US007795388B2 


(i2) United States Patent 

Paavola et al. 


(io) Patent No.: US 7,795,388 B2 

( 45 ) Date of Patent: Sep. 14, 2010 


(54) VERSATILE PLATFORM FOR 

NANOTECHNOLOGY BASED ON CIRCULAR 
PERMUTATIONS OF CHAPERONIN 
PROTEIN 

(75) Inventors: Chad D. Paavola, Mountain View, C A 
(US); Jonathan D. Trent, Watsonville, 
CA (US); Suzanne L. Chan, Oakland, 
CA (US); Yi-Fen Li, Sunnyvale, CA 
(US); R. Andrew McMillan, San 
Francisco, CA (US); Hiromi Kagawa, 
Sunnyvale, CA (US) 

(73) Assignee: The United States of America as 

represented by the Administrator of 
the National Aeronautics and Space 
Administration (NASA), Washington, 
DC (US) 

( * ) Notice: Subject to any disclaimer, the term of this 

patent is extended or adjusted under 35 
U.S.C. 154(b) by 362 days. 

(21) Appl.No.: 11/194,991 

(22) Filed: Aug. 1, 2005 

(65) Prior Publication Data 

US 2006/0084792 Al Apr. 20, 2006 


JP 

11-045990 

2/1999 

JP 

11-204774 

7/1999 

wo 

WOOO/77196 Al 

12/2000 

wo 

WO 03/080796 

10/2003 


OTHER PUBLICATIONS 

Bosch et al., “Crystal structure of the beta-apical domain of the 
thermosome reveals structural plasticity in the protrusion region”, 
Journal of Molecular Biology 301 : 19-25 (Aug. 4, 2000).* 
Archibald, et al., “Recurrent parology in the evolution of archaeal 
chaperonins” 1999 Current Biology 9: 1053-1056. 

Beernink, et al., “Random circular permutation leading to chain 
disruption ...” 2001 Protein Sci 10:528-537. 

Brown, et al., “A Genetic Analysis of Crystal Growth” 2000 J. Mol. 
Biol. 299: 725-735. 

Brown “Metal -recognition by repeating polypeptides” 1997 Nature 
Biotechnol. 15: 269-272. 

Bruchez, et al., “Semiconductor Nanocrystals as Fluorescent Bio- 
logical Labels” 1998 Science 281: 2013-16. 

Charlebois, et al., “ Sulfolobus genome: from genomics to biology” 
1998 Curr Opin in Microbio 1 : 584-588. 

Douglas and Clark “Nanometer molecular lithography” 1986 Appl 
Phys Lett 48:676-678. 

Ellis, et al., “Two-Dimensional Crystallization of the Chaperonin 
TF55 from the Hyperthermophilic Archaeon Sulfolobus 
solfataricus ” 1998 J. Struc. Biol. 123, 30-36. 

Fenton “GroEL-mediated protein folding” 1997 Protein Science 6: 
743-760. 

Furutani, et al., “Group II Chaperonin in a Thermophilic 
Methanogen, Methanococcus thermolithotrophicus ” 1998 J. Biol. 
Chem. 273: 28399-28407. 


Related U.S. Application Data 

(63) Continuation-in-part of application No. 10/494,853, 
filed as application No. PCT/US02/35889 on Nov. 8, 
2002. 


(Continued) 

Primary Examiner — Anand U Desai 

(74) Attorney, Agent, or Firm — John F. Schipper; Robert M. 

Padilla 


(60) Provisional application No. 60/340,538, filed on Nov. 

8 , 2001 . 


(57) 


ABSTRACT 


(51) Int.Cl. 

C07K 14/00 (2006.01) 

A61K 38/00 (2006.01) 


(52) U.S. Cl 530/350; 514/2 

(58) Field of Classification Search None 


See application file for complete search history. 
(56) References Cited 

U.S. PATENT DOCUMENTS 


5,428,131 

A 

6/1995 

6,338,952 

B1 

1/2002 

6,858,318 

B2 

2/2005 

2002/0130353 

Al 

9/2002 

2003/0078373 

Al 

4/2003 

2005/0130258 

Al* 

6/2005 


Trent et al . 

Young 
Kogiso et al. 

Lieber et al. 

Fersht et al. 

Trent et al 435/68.1 


FOREIGN PATENT DOCUMENTS 


The present invention provides chaperonin polypeptides 
which are modified to include N-terminal and C-terminal 
ends that are relocated from the central pore region to various 
different positions in the polypeptide which are located on the 
exterior of the folded modified chaperonin polypeptide. In the 
modified chaperonin polypeptide, the naturally-occurring 
N-terminal and C-terminal ends are joined together directly 
or with an intervening linker peptide sequence. The relocated 
N-terminal or C-terminal ends can be covalently joined to, or 
bound with another molecule such as a nucleic acid molecule, 
a lipid, a carbohydrate, a second polypeptide, or a nanopar- 
ticle. The modified chaperonin polypeptides can assemble 
into double-ringed chaperonin structures. Further, the chap- 
eronin structures can organize into higher order structures 
such as nanofilaments or nanoarrays which can be used to 
produce nanodevices and nanocoatings. 


JP 


07-067641 


3/1995 


10 Claims, 62 Drawing Sheets 



US 7,795,388 B2 

Page 2 


OTHER PUBLICATIONS 

Galagan, et al., “The Genome of M. acetivorans Reveals Extensive 
Metabolic and Physiological Diversity” 2002 Genome Research 12: 
532-542. 

Griesbeck, et al., “Reducing the Environmental Sensitivity ofYellow 
Fluorescent Protein” 2001 J. Biol. Chem. 276:29188-29194. 

Guex, et al., “Protein modeling for all” 1999 TiBS 24: 364. 

Hard and Hayer-Hartl “Molecular Chaperones in the Cytosol: from 
Nascent Chain to Folded Protein” 2002 Science 295: 1852-1858. 
Hard “Molecular chaperones in cellular protein folding” 1996 Nature 
381:571-580. 

Heinemann and Hahn “Circular Permutation of Polypeptide Chains: 
Implications for Protein Folding and Stability” 1995 Prog Biophys 
Mol Biol 64:121-143. 

Horwich, et al., “Protein folding in the cell: functions of two families 
of molecular chaperone. hsp60 andTF55-TCPl” 1993 Phil. Trans R. 
Soc.Lond.B 339: 313-326. 

Iwakura, et al., “Systemic circular permutation of an entire protein 
reveals essential folding elements” 2000 Nat Struct Biol 7:580-585. 
Kagawa, et al., “The 60 kDa Heat Shock Proteins in the 
Hyperthermophilic Archaeon Sulfolobus shibatae ” 1995 J Mol Biol 
253:712-25. 

Kagawa, et al., “The composition, structure and stability of a group II 
chaperonin are temperature regulated ...” 2003 Molec Microbio 
48:143-156. 

Karlin, et al., “Characterizations of Highly Expressed Genes of Four 
Fast Growing Bacteria” 2001 J. Bacteriol. 183: 5025-5040. 

Klumpp and Baumeister “The thermo some: archetype of group II 
chaperonins” 1998 FEBS Letters 430: 73-77. 

Koeck, et al., “Two-dimensional crystals of reconstituted |3-subunits 
of the chaperonin TF55 from Sulfolobus shibatae ” 1998 Biochim 
Biophys Acta 1429: 40-44. 

Kroger, et al., “Polycationic Peptides from Diatom Biosilica That 
Direct Silica Nanosphere Formation” 1999 Science 286:1129-1132. 
Labas, et al., “Diversity and evolution ofthe green fluorescent protein 
family” 2002 Proc Natl Acad Sci USA 99:4256-4261. 

Lee, et al., “Ordering of Quantum Dots Using Genetically Engi- 
neered Viruses” 2002 Science 296:892-895. 

Lin, et al., “The unique hetero -oligomeric nature ofthe subunits in the 
catalytic cooperatively ...” 1997 Proc Natl Acad Sci USA 94: 
10780-10785. 

Marco, et al., “The molecular chaperone TF55” 1994 FEBS 341: 
152-155. 

McMillan, et al. , “Ordered nanoparticle arrays formed on engineered 
chaperonin protein templates” 2002 Nature Materials 1:247-252. 
Naik et at, “Biomimetic synthesis and patterning of silver 
nanoparticles” 2002 Nature Materials 1: 169-172. 

Patterson “Fluorescent protein spectra” 2001 Cell Sci 114:837-838. 
Peng, et al., “Shape control of CdSe nanocrystals” 2000 Nature 404: 
59-61. 

Quaite -Randall, et al., “Conformational cycle ofthe archaeosome, a 
TCP1- like chaperonin from Sulfolobus shibatae” 1995 J. Biol. 
Chem. 270: 28818-28823. 

Rizzo, et al., “An improved cyan fluorescent protein variant useful for 
FRET” 2004 Nature Biotechnology 22:445-449. 

Sarikaya, et al., “Molecular biomimetics: nanotechnology through 
biology” 2003 Nature Materials 2:577-585. 

Shaner “Improved monomeric red, orange and yellow fluorescent 
proteins derived from Discosoma sp. red fluorescent protein” 2004 
Nature Biotechnology 22: 1567-1572. 

Sleytr, et al., “Crystalline bacterial cell surface layers (S layers)” 
1999 Angew Chem Int Edn 38:1034-1054. 

Slocik, et al., “Monoclonal antibody recognition of histidine-rich 
peptide encapsulated nanoclusters” 2002 Nanoletters 2:169-173. 

Su, Wei-Wen “Fluorescent proteins as tools to aid protein produc- 
tion” 2005 Microbial Cell Fractions 4:12. 

Trent, et al., “A molecular chaperone from a thermophilic 
archaebacterium is related to the eukaryotic protein t-complex 
polypeptide- 1” 1991 Nature 354: 490-493. 


Trent, et al., “Chaperone filaments” The archaeal cytoskeleton ? 1997 
Proc. Nat. Acad. Sci USA 94: 5383-5388. 

Whaley, et al., “Selection of peptides with semiconductor binding 
specificity for directed nanocrystal assembly” 2000 Nature 405:665- 
668. 

Yamashita “Fabrication of a two-dimensional array of nano -particles 
using ferritin molecule” 2001 Thin Solid Films 393:12-18. 

Yaoi, et al., “Chaperonin Filaments: their formation and an evaluation 
of methods for studying them” 1998 Arch of Biochem and Biophys 
356:55-62. 

Bosch, et al., Crystal structure of the beta-apical domain of the 
thermosome reveals structual plasticity in the protrusion region, J. 
Molecular Biology, Aug. 4, 2000, 19-25. 

Kramer, et al., Engineered protein cages for nanomaterial synthesis, 
J. Am. Chem. Soc., 2004, 13282-13286, 126-41. 

Llorca, et al., 3D reconstruction of the ATP-bound form of CCT 
reveals the asymmetric folding conformation of a type I . . . , Nature 
Structural Biology, Jul. 1999, 639-642, 6-7. 

Pazirandeh, et al., Metallized nanotubes derived from bacteria, 
Biomimetics, 1992,41-50, 1-1. 

Schoehn, et al., Three conformations of an archael chaperonin, TF55 
from Sulfolobus shibatae, J. Molecular Biology, Feb. 2000, 813-819, 
296. 

Shpigel, et al., Production and Purification of a Recombinant Human 
hsp60 Epitope Using the Cell . . . , Protein Expression and Purifica- 
tion, 1998, 185-191, 14, Academic Press. 

Ursic, et al., The essential yeast Tcpl protein affects actin and 
microtubules, Molecular Biology ofthe Cell, Oct. 1994, 1065-1080, 
5, American Society for Cell Biology. 

Van Bommel, et al., Organic Templates for the Generation of Inor- 
ganic Materials, Angew. Chem. Int. Ed., 2003, 980-999, 42-9. 
Weiss, et al., A Mutant at Position 87 of the GroEL Chaperonin is 
Affected in Protein Binding and ATP Hydrolysis, J. Biological 
Chemistry, 13956-13960, 1995, 270-23. 

Office Action, dated Mar. 13, 2006, fromU.S. Appl. No. 10/494,853, 
filed May 6, 2004. 

Office Action, dated Dec. 14, 2006, fromU.S. Appl. No. 10/494,853, 
filed May 6, 2004. 

Dat, et al., Mimicking a conformational B cell epitope of the heat 
shock protein PfHsp70-l antigen of Plasmodium falciparum using a 
multiple antigenic peptide, Parasite Immunology 22(1 1): 53 5 -543, 
2000, Blackwell Science Ltd. 

Ishii, et al., Novel Protein — Forming Nanocluster Hybrid System: 
Stabilization of CdS Nanocluster Using Nanopore of Tubular Pro- 
tein, The Chemical Society of Japan, 80th Fall Meeting, Sep. 7, 2001, 
222, 1C1-04. 

Jiang, et al., Formation of Huge Rotaxane by Encapsulating Lumi- 
nescent Dendrimer Rod in Nanopore of Tubular Protein, The Chemi- 
cal Society of Japan, 80th Fall Meeting, Sep. 7, 2001, 209, 3BC-07. 
Miklos, et al., Primary structure and function of a second essential 
member of the heterooligomeric TCP1 chaperonin complex of yeast, 
TCP1|3, Proc. Natl. Acad. Sci., 91:2743-2747, 1994, USA. 

Shipway, et al., Nanoparticles as structural and functional units in 
surface-confined architectures, Chem. Commun., The Royal Society 
of Chemistry 2001, 20:2035-2045. 

Response to Nonfinal Action, dated Mar. 13, 2006, in U.S. Appl. No. 
10/494,853, filed May 6, 2004. Response filed Sep. 13, 2006. 
Response to Final Rejection, dated Dec. 14, 2006, in U.S. Appl. No. 
10/494,853, filed May 6, 2004. Response filed Jun. 7, 2007. 
Nonfinal Action, dated May 28, 2008, in U.S. Appl. No. 10/494,853, 
filed May 6, 2004. 

Response to Nonfinal Action, dated May 28, 2008, in U.S. Appl. No. 
10/494,853, filed May 6, 2004. Response filed Nov. 26, 2008. 

Final Rejection, dated Feb. 20, 2009, in U.S. Appl. No. 10/494,853, 
filed May 6, 2004. 

Response to Final Rejection, dated Feb. 20, 2009, in U.S. Appl. No. 
10/494,853, filed May 6, 2004. Response filed Aug. 19, 2009. 
Nonfinal Rejection mailed Oct. 19, 2009 in related U.S. Appl. No. 
11/653,479. 

* cited by examiner 





Section 


U.S. Patent 


Sep. 14, 2010 


Sheet 2 of 62 


US 7,795,388 B2 



</> 

SC 

)— S' O sc 

* 

CO 


t 

-J 

sc 

1 

1 


O < O C| — 

LlJ j LlJ I^S — I 2 

" 5 " 

“cT 


« 


H 

Li. 

\ < 


±±L 

J ±i 

< 


Ll. 


Q 

ojuj 



2 

^ 1 - 1 


or 


o 

t— 

< 

O 

o 


—J 

1 

—4 J ^ 1 

— i 

i 

2 

o 

2 

Q C 

U- 

u. 


Q 

1 

Q i o i 

E 

3 

cr 

Q 

2 

1 - 

[» 

>]~ 



1 

u. i oe i 

i 

u. 

00 

U- 



CO 

CO 


f>1 

1 

□ 'S' 

0> 

— 


O 


jz£. 



lS:J 

1 

fe] ' *■>* 1 

E 

it 

in 

►— 

X 

IE 

cl' 

‘cl jCL 



1 

O 1 OS 1 

>- 

'o 

2 

o 

2 

<C 

o’ 

"o 


o 

1 

O 1 c 1 

o 

< 

QC 

CO 

to 

I 

UJ 

UJ* 


UJ 

1 

< 1 — 1 I 

t — 

< 

CO 

cc 


<c 

2 

2 


CL 

1 

— « 1 sr 1 

< 

2 

00 

CO 

o 

DC 

1 

1 


1 — 

1 

2 1 <£ ) 

<t 

1 

> 

2 

CL 

00 

1 

1 

_ 

t— 

1 

112 1 

2 

1 

Or 

2 

2 

> 

} 

1 


< 

I 

1 1 O 1 


1 

CC 

U- 

s: 

CL 

1 

1 


> 

1 

I 1 O i 


1 

< 

_J 

s: 

QC 

1 

1 


1 — 

1 

1 1 O i 

1 

1 

z 

o 

CO 

2 

1 

t 


< 

1 

1 1 to l 

1 

1 

O 

CO 

CL 

O 

1 

1 


h- 

1 

i 1 UJ | 

I 

1 

< 

2 

2 

DC 

1 

1 


<C 

) 

1 1 CL. f 

1 


— « 

1 

1 

— 1 


1 


2 

( 

{ f cn i 


1 

QC 

1 


> 

J 

1 


) 

1 

I i i — - i 

J 

1 

C 

1 

t 

»— 

1 

\ 


1 

< 

1 { 2 \ 

i 

I 


1 

1 

Cl 

1 

1 

l _ 


1 

) 1 u l 

1 

J 

CO 

1 

1 

— j 


1 


1 

1 

t 1 CL 1 

1 

J 

< 

1 

1 

DC 

1 

1 


1 

1 

1 1 Hr | 

1 

J 

— J 

1 


l 

1 

t 


1 

1 

1 1 SC ! 

1 


z 

l 

1 

2 

1 

1 


1 

1 

11 — > 1 

1 

1 

CO 

1 

\ 

1 

1 

1 


\ 

1 

1 1 . LU | 

1 

1 

< 

1 

1 

I 

1 

1 


\ 

1 

1 1 m i 

1 

) 

U— 

1 

\ 

1 

1 

\ 


1 

1 

1 1 C I 

1 

1 

QC 

\ 

1 

I 

1 

1 


1 

1 

112 J 

I 

1 

>~ 

\ 

1 

i 

1 

1 

. — 

1 

1 

1 1 1 1 

1 

( 

2 

J 

1 

1 

1 

1 

(l) 

(l) 

0 ) 

0 ) 

(i) 

0 ) 

(D 

0 ) 

(i) 


T— 

(0 

(D 

(l) 

' w - 


<L> 


a 

cn 


<n 

x — 

o 

o 

“ 


o 

cn 

. — 

— 05 

c 

X— > 

o 

— 

o 

c 

tn 

C N 1 

o 

Jg 

o 

< — 

u 

o 

o 




_c 

o 

L. 

— 

fc 

— 

. — 


CL 

o 

o 

o 

CL 

o 

JC 

UJ 

O 

o 

> 

u 

4 

JC 

tn 


-o 

sz 

•* — 

u 

• 2 



] 

• «* 

u 


CL> 

<C 




u 

O) 

05 

JO 

I 


cd 

— J 

o 

c 

o 

3 

1 

< 


UJ 


>> 

o 


o 


1 

a 


cn 



O JC 

1 


u 

H— 


# 


-C CL 


o 

O 


O 

2 

2 

O. — 

o 



1 

to 



— O 

to 

05 



CL 

kj- 

1 

o 

05 

-O 


o 

CO 

d> 





.4-1 

X 

n 

05 e 

X 

in 


05 


cp 

to 

E o 


in 


JO 

— 

CL 

CL 

O tn 

L> 

U- 



o 

CO 

CO 

tn o 

o 



0J 


X 

X 

Q a 




a 

u 



e u 

t- 



o 

<u 



V- Q> 

x> 



CO 

— > 



05 _C 

c 




o 



JC 

o 



a 

o 



4J 

xz 



L_ 

_o 




L> 



<u 

o 




o 




c 




—4 



—i 

o 








>% 




a 




o 







Q. O O O. CL 

u - - o c 

h“ V. L. » — t— 

-o -O 
c c 

o o 

JC JC 

O L> 

o o 


a a 


</> 

UJ cc 

x> 

o =3 

to 2 
cn 


o 




i 


CO 


5 8 
3 o 

Q m o 


fo 


CO 

I 

2 

o 


Oj[C£jO O 


FIG.2A 



Section 


U.S. Patent 


Sep. 14, 2010 Sheet 3 of 62 


OO 

to 


o 

to 


lT> 

m 


in 


J^| > {^! 

Id* id 


>P*!> 


r**t 

> i^i 

ZZZ 

0.0 

*y 


-u 2 

eg n 



»-*■ uu va- u. — > ; <_i_ cx.{ i 

[o e> o o o o o o o oj oo[cp o o] < 



</>f> > [in] > fcol 

ofsSsl 
L<-.<iEaIls ■ , - i - 


'(-gif ^ K-gl <=>[•& 


«cl — tul-cl 

JC 2 o _ 2 


E U. O <C *g 

..... <1> *-G9>f< •<! 

EaE3L^lEaid®L^Jli3l2©Si^si 

<Qiuj^Q;£i;j2^ -( CL02a5 

E 


LSI 2 

c y; 

CK > 


U U W _j > > _j > >| — 

Zf — > — » — t [af jz] Oil O 2 !Z 2] 2C 
uj < O o < 2 — iCT<aC — lOO 
^QiOo:a:i<2<cww 

0* 
<c 

Q 


-j jctc| 35 jo3 *— [ocrj .u Oja: a:] «u [ST og c£ ] 

~ [<i > 


N q c| a: iuu lu lu o lu o] a. [oj ►— ■< 
^2^ujujijjq;^>qu.<uju 
ro'T3~~c5 ] as lu o jc jr~o“'o "o) cn ?o jcTo] 


o 


SIS 00 (HI ^ IHISl 3 cEI^> E 3 1 

H^-^Lu*— ><i— .vc^rcoco 

fafl > {or) —• cl — lor crj — (3 < > frc a^l < 

wo^trui-o^Lj I eg q i i 


*o m in 

CM "—a *— ■ 


-c-v. coitiqctj — ct>»— *— m 

• rO " — « — *“0 * — <N CN * — « — 


D 


§ 

to 

v> 

CO 

X 

O 

«■*— 

3 

c 



aJ 

O 


O 

a 

v> 

c 

o 

o 


o 

L_ 

o 




~cz 

o 

o 


<L> 

» — 


CL 

o 

> 

3 

CL 

J= 

LU 

O 

o 

• — 



<n 


T3 

-C 

aa 

l_ 

. » 


1 

♦— 

u 

O) 

a> 

< 

» 


O 

<D 

u 

u 


C/l 

» 

O 

c 

o 

3 

1 


LU 


>> 




( 

o 

- 

CO 



O 



I» 


2 

4 

-C 

o 

o 


o 


2 

CL 



I 

CD 




<u 

.Q 


o 

a. 

OQ 

d> 

1 

o 




n: 

to 

m 

a» 

m 

m 

Ll_ 


<D 

-O 

a 

§? 

CO 

& 

E 

O 

<o 

1 — 


QJ 

E 

L- 


:r 

1 



o 

a> 



In 



in 




<u 


U 

O 

-O 

o 

c 

o 

>s 



o 

z 

LU 

O 


CO 


O =J L*- 1 
OO 3 uj 

& «« s 

. U- 5 

g° £ 

== 2^: LU 

te o tn 

LU 2 O 
Q CD u 


US 7,795,388 B2 



o o 


FIG.2B 



Section 


U.S. Patent 


Sep. 14, 2010 


Sheet 4 of 62 


US 7,795,388 B2 



< C cjjojjc < <cl c 
t — jojg[o] — jopgflo ojjclio o[ fc c c] o 

i 3 i h- i i r> i j-T™ i i 

i z i z i 2 * i i z i 2: z i i 

! U I UJ I bJ I ) I ^ K I ( 

l u» i — i >- 1 1 * — - ( >— >— 1 1 

FclI ^ jol] 3 = j*cL gl ‘ aToll jail ^ [cl ‘ all a_ 
!=cj oj:c| o: m fa j:r i q x!q o x ml n: 


“F 2 " 


i=sj di == PLtse, 

I — » — j > — » > -j — - 


I^OO^ [LU^Ujj UJ 

i u. f > zrzr> >i — i 


a Q o [lZj] o [ujj 3 o |C^ji o [<] a ILu uJj uj 


E 7 

g 

U UJ UJ u uij 
^ ^ ^ ^ ^ 

CO 

ii? 

CO CO 

CO QL.fk 

]<C 


fe 

“’-HI Gl 

;[>j 

ike! 

LfcWit - 1 

B 

.tnJ <=» Ltn-J 


>>2>^>C<>C I 

~cf o'o'o'o o ~o“ eToj i 

C3£X)Q00QQ0 0i i 
52) Z Z iCOjZ Z Z 52) 2 j 1 
’ 1— h- prpr%- K h* hvh-j I 

° >-■- :«e[E3 ) 


> < c 
ro“o“o! < 
(a o of < 
^iz z|: 


y; }h- t-i 


I CL [~] Q- PH Q_ >1 Q- U> I CL gg] ■ 

| «< [ey- <c [o] < (o“o] << jo] 1 cn [o"o] < 

o o 000 o~o o o o c3 - c3""c3j < 

| lu -J U. ]>> 3 t| u- [ 3^ — >- ^ ~ ~ i 
co col ^ a. bg (<cj< on l a (co oo|o a i 
| ^ foj ^ z « ooio jojaz o r o o ji 
I q Lij f> 1 uj :> >|lj!>; 2 *: uj > >!: 

zra: 


>iL=L=ll>li 3 U ' 


uj > j: 

Efcd ; 


OlOlOOOOCOOOOOMSrvJO^^ 

Ki^rotDOin^ioinmto^^ 


— o a <0 

■ — o u O 
-cr o o — 
a o > d 
O O •— o 
*0 -C — * L_ 

■ — o « a> 
o <u o _o 
o c o 3 

>s 


O 3 

I CO ^ 
XII 
o co o 
X tu in 

<U Q_ CD 

n - 0 a 

o u 
to — * 

§ I 

l_ _o 
Q> O 

£ S 

>x 

U 


-c l I I I ] c 

o 

o o o o 00 

‘ r to to £ £ 

<c aa a cl cl 

» o Sfi £ o o 

3 51 o o a! 51 

a u - - u o 

cn 1 — t_ i-» — 1 — 

Z TJ TD 

C C 

— OO 

o jz 

— <j o 

i— 00 

*0 -M 

o i E 


FIG.2C 



Section 


U.S. Patent 


Sep. 14, 2010 


Sheet 5 of 62 


US 7,795,388 B2 


< £3 z G3 z L-=L=3 ^ Ejh£ * (=L.=<j 

fQl ^ O OS OtS OC Sg |q 1 SC fuj UJ UJ UJ ml 
uj — I O — I > —JUJUJOZCuU-OO 

^ U O UJ ZliJ^^UQCUJUJZ Z 


j H— 

o Q^’ 


UJ 

H- < 

sc IsC ac 


sc 

— 


— > 


3**. 

o ^ ^ 


— > 

3~ 

Sl>; 

l&L^T. 

1 

-^ ! S 

rzn — — f-j 

i — j i — 

UJ to to UJ 

13 

__f 

» [< 

o L<j 


UJ 

< 

► O 

o 

to o 

o 

o oc 

CO QC OC < 

c 



paT~<: <: < “< t— [< <T'< <f •<] < 

>x3SteS > 

n>i ►— rvj >— [>] ►— i>i •< *— [>] •— ►— [>_ 5>] > 

tir ^ I — I — I — | — (prj| i—j“o fco|[»^~ j (y) col •— 

SC ]*— 1 — H tSg*] .* 7". _ , *7 7- H- w ~hT* ~ , pr |^> »— 

V^r >— t— t — ►— t— t— * h— t— 7— ~ i — 


OOOOOOC 30 
|Q Q P O ' Q Q Q Q 

!•<! [o~"$rL * ’p"T5~X3~o" ~ e> 

fuj ] *< u. C <c > [p2T”uj 
|lu q} v) To] ^ fo| «< < 
S] z fa] ac fcT~o~‘'Q~Q 

isi < Ls! *- 3 Ro_o 

o sc ►— adtoacci— 
ac co sc to uj sc xc ac 

B> ^ ■<~Qj> > > 

O O SC iUJ UJ UJ 

> iC > QC SC 2 > > 

j-J > 7 g ~ _j > -J ~ ~ 

Pl la al f-^l 


h— h— | — H— H— H- I 

O O o O O O O < 

p Q Q Q O Q Q i 

~'cT'o~~ 15l > [o[]ePo3 ( 
><-<<>> 
Jjj] > fuj Uj uT~UJ ~~lIT| I 
< [S] QC I UJ UJ 1 SC SC < 

~o] z jo] z ac [o o ] i 

.i?l *“ la! *- *- Is:_slS 1 

»— < o Z Z -J ~J 

ac z o z z o cd 

"< 

> f .>iB l>„>jUi^l : 

Ujj o fujl l CD jp] ;UJ Uj] 1 

> ac > o O o o 

■—* >» m — » 

l=lElriIiS3l>_>J 


t-o o> csj oo cm o 

OO N ts h- 0> (N. CO' 


GO O (N ^ 

O oo r-'. cti r-** 


o> 


I 

CO 

CO 

CO 

X 


o 

o 

— - 


c 

IM 


<u 

c 

*•—» 

O 


o 

o 

to 

c 

M 

o 

o 

u 

• — » 

o 

L_ 

o 

u 

D 




J = 

o 

o 


QJ 

e 



* 

cx 

o 

> 

. 3 

CL 


o 

or 

UJ 

o 

o 

■ •k 

L> 



jr 

CO 


“O 

-C 

-*-* 

U 


36 



1 

• — ■ 

o 

a> 

<u 

<c 





o 

<n 

u 

jQ 


i 


CO 

» 

o 

c 

o 

r> 

1 


< 


UJ 






o 


J 

o 

• 

CO 



o 

x; 

l 


Vm 

I — 

o 

as 


£L 

CL 


o 

o 



ag 

Cl. 


o 

• 4 m* 


t 

to 




O 

to 

QJ 

JO 


o 

CL 

to 

i 

i 

O 

a 3 

CL 

£ 



4 -» 

az 

to 

to 

ftJ 

' E 

UO 


<D 


CL 

to 

e 

o 


£ 2 E 

a) 


to 

I ) 

c 


o 

o a 
j= -cr 
CL CL 

o 

O O 



& •? -5 


— ; U} Q- O co — 


g52 

00 3 L- 

£ «/> 1 


oj(oJ[ojo o 


O 


FIG. 2D 





Section 


U.S. Patent 


Sep. 14, 2010 


Sheet 6 of 62 


US 7,795,388 B2 


CO I O 1 t 1 UJ CO I Q 

> 1—11 i :» c j > 
i t i i t i i t { to 

0-a^QLOuQ.W5W 
O > 1— Qd QC ^ > — o: 07 

0 <U<n>-^UJh-l/)Ul 

OiC 0*£0’—>00*£Z 


> O 2 U- -J — 


D ^ f>i >- * — 
*- >• 


ICTli— 

uj[EJ 

E3C 


fcj O 


oo i— is > 
«c <c < 
lial PCI ag 

or O LU LlJ 

Mr Mr 

ui o: UJ Of 

in ^ ~ 


ctr SC | H j oc| « 

rJ, ^ J2z , * 

E2>SI- 

u O < uj ■ 

3 — fo~joc p 



_< «< <T<: « 

^ E l > 1 

v— 01 on cd : 

E3 -P: 

c> o o o '< 

UJ </> Od ) — ( 
— Gd — < 


• > > 1 : 


2 3 O' O' 

> >ESjE 3 : 

o op ‘c^Tol i 
cc qc go on 

QC Od — — 


BBSBHP!- — ii> >i - 

Rafq - RniF'^l = p i > >ff =T=l> 

P- CL Q-] < jo. CL CL CL < O. Q. Q. CL CL ( 

y*u * LrsuJ = brsniia^E. = 

I—* jg >1 «t f>l < Cl >35 —I •< c l— 1 ^-'1 - 

^ KS z E5I«C‘<3 x S5_4a] *= [S“<S1 ^ ^ « 

v:<occcujco-<zidSdOO 

>-<2<a;<Qu<^(/){/3ii:y 

Ps [^i^iRf>iPnr>i crzii r=r~>i[=^r>H>i : 

^ IT. E; *^> <£ to* 00 'S" f 

r-. <NO>r-OKno — 


— o o to 

O L- o 

JC O o — 
d O > 3 

o o — o> 
“o x: -*-» ^ 
o qj a> 
U <D U -O 
O C O =3 
>s 
■ CO 


x — a 

<U C 

C N 4 a 
i_ o — 
Qi G — 
cx a 


w 

I <0 

o&A 1 

I U3 if) 

<u a_ to 

og 8 $ 

E 1- ^ 

0 <U 

to 

1 8 


-C Cl 
CX — O 

— O to 

^i 1 

O CO — 
V) Q O 

§ £ — 
L. l~ 

u ai "o 
a> jc c 
4 = -«-> o 


0000 

r to to r 

a, CL CL. d 

- to to - 
o cc x o 


HI 2 <u 
to 

1 I c 

o 

000 

-C 4= 
d d 


o_ o a a_ cl 

p — — o o 


y ^ ^ 1 ) 

p a s ^ 

Z r -5 2 : 3 

u y o S 

Q CD o 3: 


FIG.2E 



Section 


U.S. Patent 


Sep. 14, 2010 


Sheet 7 of 62 


US 7,795,388 B2 


O 

CNl 


o 

CD 


o 

oo 


QE3ji >• >M>3 CO CK 1> > 

o] cofo ot>l < o1>lo ol > > o 
> — > - > s -j > 2 > > a a|> 


<Q<QKQ-I/)luW^^< 

u. SIS u. 


Qgj c| {c < < 

<c ~ c ~ 

I — I I 3E 
I I 


o 

1 — I 

car o o 

too] > m * Ea 

•< > CO > UJ — * 

> I- < U 1 - W 

c Ui CO U- o o 


^ X g] u. J3T 

eg] c [o^Q O.cT 
oc ^ ic co co ~cT] o 

UJ X « — • 1 *—• ■ — * I 

CL. I t | — . — if 

O I LU O 2 2 2 

*- E5JSD5JL421 — 

O UJ ~ o *-• — o 

> < LkJ »-« U Q ^ 

L o cc - ^ ^ - 


plsf2 

{CO CO CO 

o[52]Olik :*3lU 
2 O 1 tcoj o O 

.ss. 

o o \^c K j 

o o isCL.9?J 


0Z20»— CD<C»— 2 


/A f A > A 


5 < -< <] _j 


'CO to CO CO CO oo 


CO 2 2 CO 

*-f<]2fc <Tj2 
E-Suo, </> r oo j# co 
EL~0 3id — —Uhl 


>- < 
o] o 
o <c 

1 

2 

— o 

o 


00 

2 < 



><«^jtooo;>v— tut— ljluoo 

HJ*= < ic E5 j3 w uj 

I— 00 Ss£ </> UJ O »— CO^Q-Q-Qror 
QQuOt/)^QCil — o* — 1 — OO 

2COOUJCLUUJUJCO_JI — »— i~J 

FH O -< Q>] I |>~ >| Cl ( 3 »— i> >1 uj uj > 

■«f* oo Xo O rs h- if) oo cn cnT cnT 

m O ^ ro lO ro ^ , ti£)in'<‘tD' 4 ''tr s < 


<D 


£ 

<o 

Ol 

</» 

X 

— 

o 

o 



3 

c 

• — 

*— > 

4> 

c 


o 



O 

o 

tn 

c 

Nl 

o 

o 

o 

, _ 

CJ 

u 

o 

L. 

C3 


-O 


JC 

o 

o 


CD 

£ 


• «— 

« 

CL 

o 

> 

3 

CL 


o 

jC 

UJ 

O 

o 


U 



JC 

t/> 


T3 

-c 


L- 


2 

— » 




3 3 


CO 

uj ce 
XD < 
o =2 

CO 2 

& ^ 

SS 


2 

UJ 


8 


1 

CO 

2 

o 


^ 1 

±J <? 
¥ 2 
o 


o 


Q 03 O 3P 2 

f51f5]fp]o ta 


FIG.2F 







Section 


U.S. Patent 


Sep. 14, 2010 


Sheet 9 of 62 


US 7,795,388 B2 


eg ccco— 

to fujj[B]j 

{~j _j — ) —j] >- [ — > — j i Li _j 

z h rsrztt ? 

CCLCtLl— CL-J-J^C 

RQRIEISQR[>3F] 

0 Q 3* O K < CQj > | 

1 OC I O < 2C I 12 2 

t — ( > CL > I l—o 

2 2 o -j 2 >— 2 C 2 to o 

O U00^0000ujyw< 

H—i GEliEl >- 1> > >j ;E3r'J 

lor^ yl > [|Z] to 1^: oc| > [ ^ or * 
q [S£ to £]uj 12~ 
cx-Tofo? Q*jo_ co a. a,]ujla. 

j=§j < si ^ laHS iaal x 

foj — 1 jo] — O — J j< oj — [< 

CL — J CL — ILU— JCLh— — » O 
X — 3T — or — IT 2 — CO 

O >Q.^Q.Q.h>q:q.> 

[uj] to fuj uj| as to uj lu o} | 

idUJlliQLiJUlCiCOU 
G>~JOU.O_JCr}Q__JZ 
— T >1 L^j |mi 

>>■— c — <> — o<c 

— <c — uj — i i_a — :> v*£ >“ 

f515~ol 3 [oj o [o o] O [o] 
>- >— 2 k uj q: of uj i— cl 

Q uj rj uj r uj r>> j sd n> 

CL 1—jJ I— C CL -J — O [_J 
07 02^00>Q^c:uj2^_j 

CNJ IQzE^I — |h— CO CO Jx tO) 


u~lj1 [q" 


«< H- O O 

a. a. < c 

<0r“] 

co co c < 
0 0 1.1 
> > ! I 

to to 2 2 

CO CO > > 


^ ^ a: 

snz “s*' 

UJ UJ CL 

00 to [3_ 

I — J fo 

-j I O r 

> > to 
>- o 

«C <C !> 
o q| > 
o o o 

U-. U. 2 
LU LlJ r=r 

o o -< 

iC ^ >- 

oo(5: 

CO o to 

: -r 

I to _J 


CD CD CO * — lO lD lO ’ — uO tO uO O DO CO O) 
to 1 - O O OJ o ' — — PO*— COtOOOtO 
CN (SJCM<NC\(N(NCNCMCN<Nt-CN<NNCsJ 


a> 

. _ 

§ 

tn 

to 

tn 

X 

— 

o 

o 

— 


C 

. — 

. — 

a> 

c 

-♦— » 

o 

— > 

o 

a 

CO 

c 

JS 1 

o 

o 

o 


u 

l. 

o 

L_ 

p 


jQ 


JC 

o 

o 


a> 

P 

— 

• — 

UJ 

CL 

CJ 

> 

3 

CL 


o 

JO 

o 

o 

*— • 

o 



JC 

tn 

t 

-O 

jc 

•M* 

l_ 

c’ 





* — 

o 

OJ 

<u 


i 


to 

; 

o 

GJ 

o 

JQ 

I 


< 

UJ 

o 

cr 

o 

=> 


o 


l 

o 

, 

>N 

• 

-hJ 

o 

JC 

1 


o 

u 1 

o 


3 

CL 

cu 

r-s 

o 

LO 

CL 

* 

o 

1 



-t— ( 

to 

to 

uO 

JU 

lD 

in 

<u 

2 

CL 

to 

£ 

~Q 


to 

Cl. 

o 


— 

JZ 

to 

cn 

►— 

a> 
- E 

o 


nr 


O 

L_ 



l. 


to 

<u 



QJ 


9 

-k_> 



JC 


E 

u 



—t 



o 





tu 

JO 





JC 

o 






c 






o 





-C CL 
Cl — O 
— o 

O CL 
ni tO 

GJ E X 


OJ 

</3 

I I t 1 I c 

o o o o o o 

_c co to _c _c= 

CL CL CL CL a. 

o «e *e o o 


CL O O CL CL 

u a a 


[o}0[33o o 


o 


FIG.2H 





Section 


U.S. Patent 


Sep. 14, 2010 


Sheet 10 of 62 


US 7,795,388 B2 


in (o][ujj > jug > > [33 i 
& oc o' * ^:| o : 

h- . 

H- fe Sl ol— jo O E ""I - 

oo:<<o>>QCi 

7 E] h- ui [ 5 g <t (*: in^co i 

— • nij“uji _ i ra — j fuj ujit 


53 ^ 53 — * * ** — * [33] — 1 |u3| —4 — j |lU*uj 

UJ I — o ( 5 T]S][ 5 ±*] LU O 
UJ HE] < 2 ] UJ 2 U „ 9 £ ^ EH zb S 

3} >► jgg > lu > [ 3^__3 q* [ 3-31 02 

> 1 U- | > u. U. |-~» ~ 3 3 ) 

y < ^ — ) -j -j uo j— i -jf o j-j s[ o o 

iH-OHJh-UOhliJl-HQO 

[Us] <c *— «c c |co -j a] c 1 -j co on] _j _j 
o pT^ 3 j I-* ~i y o [31 or _ j _j ^ 2 ^ 

H“ 1 <C CO «<j Or[^juJ ol<lui j< <l ui UJ 
O. UI Q- U > LJ - • a C> Q. U UJ CL CL 
QOOi^COOQf— COQOCDt^Q 

2 juj U Ui Q lU| ►— i— |uj Q) 3C (o| 1 — t— 

EadES ta ^ By3>->JE3 

QTOCCO^CDCC^OZOQ^^- 

Fg uj [3] uj u. uj E 3 ^ E3 ^ L>j >1 

u< 2 < 2 <yu(^ 0<<00 

<Q*-B^B>- < [E > Ej ->">1 

q^o*— o~ i— ~-oo 

o |[ 3 “ 3 j u. cl [ 33 ] i— * fZjfjirs > >l f 3 ~ 3 j 

joT oC cl“qT 3 —> | 5 !~al] f~ fal] 3 i i'qI] 2 2 

ej o y o o o o y o < </) k h h 

><^<^<>jo;y>-iyy 


to fuj} >• o > f 31 
°* S? ly hg CKtZ o | 
fZji] > [flj 3 ] o 

(~Vr =■ > -j j 

2 C3 2 o O 

uj [ 3 ] uj las 5T 

53 - 53 

uj i — o{5T]c^3 

UJ [2] C [ 2 ] Ui 

53 > [3 > ^ 

>1 Ljl 
y < y I —) -j 

1 H O L J 

fs1< ^ < < 
opT 23 ]H 
o *“ l^C <n a or 

04 _ Q. UJ £L LJ > 

O O Q 00 

2 juj uj uj Q 


ir5L^t-r^ilS fo r , ^ 
uo y 0 0 o o 2 * o 

><^<t/)<>uct: 

u y u a: y u u w y 

£ £jj < £3 < br EOEE1QI 

CM 00 >CL>y>CLCL< 


1 y y 
coo 

2 [33] . 

C CO CO 


rn *— O f-o cn <T> 00 > ro o O LO O o.to 

o- iD^^ninro^'cj-fvLn^tD^TfrN 
04 <NCN(\IC\<Nr 4 fMCNf 4 CMMr 4 CS|C 4 CN 


QJ 


e 

cn 

cn 

cn 



0 

1 

O 

— 


3 

c 

• — 

•— . 

0 ) 

c 

CO 

•U» 

O 

■— 

O 

0 

tn 

c 

fsi 

Q 

c 

0 

a 

. — ‘ 

O 

L_ 

0 

L_ 

g 


£ 

J 3 


JC 

O 

O 


<v 

£ 

— 

• — 

1 1 1 

CL 

O 

> 

3 

Cl 


O 


JC 


O 

O 


0 



-C 

1 


• — DO) 
_J o Q> CJ 
UJ o c o 
O >? — : 


» — 2 UJ 2 UJ cn 

^ I l I § ^ 

to 

I I I I 1 g 

OOO o 00 

JC UD to £ £ 

Q. CL CL CL CL 

— co tn — — 
o -jz zc o a 


o Cl O 
^ co to 
V X CL 
jQ 00 

— 31 
O O 

H 

O v_ 

co a> 


CL O O Q_ a. 
O - — o o 


o — *0 XJ 

cn a c c 

o 00 

E i— sz jc 

U -O OO 

a> c 00 

JZ O -*-> - 1 







Section 10 


U.S. Patent 


Sep. 14, 2010 


Sheet 11 of 62 


US 7,795,388 B2 





NON-SIMILAR EXA. 



Section 11 


U.S. Patent 


Sep. 14, 2010 


Sheet 12 of 62 


US 7,795,388 B2 


to 


o 

CO 

►o 


o 

8 



O I I iC I- 

I— I I 2C UJ 

to Lu ) to [Pyjj 

[o]CL UJ [O' O) I 
< ^ Q < UJ ^ 
O 



|cn t— to 

:^HSa 

ledr®*'*®] 

C9 ZC S3*— ►— 

pi - px-j <3j 

| > }1 — t— t — 1 

[■<] >• i-aJ—L -sj 

IS_c 

SI 

□ <□ -«Q 

m o Eg-g-ar 

UJ O o O UJ 


^ yl h- kf&T 
o uj o o a: o 


feLfi3 -■ £03 3 «=*Ts JSLSJ3 

(/)Ut/lQ^(/)OOOOCOuTXKa: 

icc bd ~u [2] cu < SIS 


SI -4 

SI *- 

CTTj 9 

<< <c 

Pi® 

S3 <=» 


^L-i 36 

s njjHgi <=> 

H •- R 


> 

cr 

ice: ad uj [ogj UJ uj [oejaij tt: 


Se ' j2j a S) o ° 

> >^o S£> 

Q£ Qij® j2] Q O jcc 


>1 

B£| 


l> —I > 13 u- f_i :> S —I > _I _J > >1 

B3 ^ [<3 ^ Gg Gs3 1 Z 1 o jz z [<Jl3 

I— » dfei >- C£ t — I — I <1 U. f3E 3= — I — I 3E 3S| 


-r- r^r^< 7 >a 3 LOtr>roo>i^cO‘r- cm to to 

•M- *—■ O O O CM O «— O tO ♦— 0 O r<> O O 

ro Kj^rjKifOfOto^nKicMfOfOton 





in 







<L> 

■ -— 

i 

3 


LO 

- x • — 

o 


y 

O 

— 

U 

in 

, w 

Q> 

c 

tn 

3 

N-l 

O 

—— 

o 

c 

00 

r~ tsj 

o 

c 


o 

o 

* — 

o 

o 

o 

<5 g 

♦ — 

UJ 

5 

n 


-C 

o 

u 


— 

>- 


» — 

* ■ 

Cl 

o 

o 

=5 

Q- 

D 



JC 

UJ 

O 

JC 

> 

u 


JO 

1 

i 

CO 


“O 

o 

. — 

u 

■ 2 

w 




i 

• “ 

QJ 


V 

c 


o 

o 



O 

c 

a? 

jC 

) 

♦ 

JC 

CO 

co 

-J 

o 

>N 

o 

o 

\ 

< 

CL 


UJ 


in 

o 


o 



(A 

1 

o 

• 




O -C 

1 

o 



t_ 

H" 

o 

• 


jr o- 




o 

o 


SP 

2 

2 

cl — 

O 

r<* 


-*— » 


1 

a. 



— o 

to 

a. 

o 

<L» 



CO 


1 

o 

O- 

o 


JD 


O 

zn 

1 




t— 

L_ 



a q_ 


& 


lO 

in 


a> — > co 

~ .5 &5 

<1> U 3“ 

£ a> 


to 

CO 

a. 


c e 

o 

in 


«e s i .2 


CO 


o 

o 

_o 

o 

c 

o 


V. 

a> 

jC 


X) *o 
c c 
o o 
-C jo 
o o 

o o 


e s 


04 

o 



to ^ 

UJ c/) 

or 



CJ 


G WEAKLY SIMILAR EXA. 
G NON-SIMILAR EXA. 



Section 12 

(375) 375 380 390 408 

1 i 

TF55 beto - S. shibotoe (349) - D[i.TjY[A}A L [v] E E R®V G E(d||K|®V ^ 0 ^ (^1 a IE! n p 


U.S. Patent 


Sep. 14, 2010 


Sheet 13 of 62 


US 7,795,388 B2 


o a. o sc e> cl to o x o 
R1 a* m o [<] z ci [to] ^ 

-I [52] -4 (gj -4 g gjg] -4 to 
^ o <c O 2k: H- 

a 


< O <] I |< o o < 


> I— 

ce Q>) 

UJ u. 

RH 

>■ 


i: 


uj o 


« ED os i— >1 “ (3 

bJU.UI L.U.UJ-J 

O > ‘O > I— O . 

.. - 0 

z 


UJUJ Ql C/5 


LU Q 


CL UJ 
Q O 
>* > 
QfSl 

LO I> 

«— o 

C UJ 

^[>j 


to C 

UJ *— O 
uj to > 


or lj£j 

O *— 

“ r<5i 

BLi 
o o 


^ O K UJ O ^ Q 

Ct Z O O Q </) 

>- >->>>- L. 

to [ 2 ] Q 0 O D@ 

to UJ to CC 

UJ — J O UJ UJ 

f— sc to uj 
U iQ 2 [> 

[u 3 ] to [uj] — j 
— <J? — 
o [<] uj Qc 
a: ac o >- 
Q£ [O] G£ [o~o] Z fol 

bUbl^ 


LU O 


1< _<] o 

to _| 


jQlrdJ 


;> 

ac 

o 

o 


o o 

Q CJ 

■> 


o 

c 


D O >- 
UJ to 

o to 


ce 


CL | | 

a £ 

UJ ) 

CL 

< 

1 

O I f 

UJ 

1 1 

L 


l 

C 1 1 

UJ 

1 l 

UJ 

> 

1 

— 1 1 


1 1 


c 


< 1 1 

o 

u_ 1 

X 

CT> 

rn 

rO 

(356) 

(339) 

(345) 

(341) 

•o 

5? c 

to tO 

to 

to 

fO 


Q 

QC 

QC 


C 

33 < 

1 

TST 

“521 

X 

1 — 

i— 

a 

z 

z < 

-j 



OC 

13 

3 - 

UJ 


) 

z 

— 

— . 


-J 

-3 -j 

Isd 

UJ 

UJ SC 

!uj 

o 

-51 O 


Q 

o 

UJ 

o 

o 

>• 

— 

— 

Ul 

Ul 


t/> 

LU 

UJ 

4— 

o 

O 

h~ 

> 

> 


f>" 

‘ >*] > 

[oj 

lidL 

LU] UJ 

-J 

LU 

UJ 

o 

SI 

12 < 

UJ 

o 

o 

►— 

[o'‘ 

”o] o 

[HZ J brdL 

J j \ 


Cr c u- 

— ' •< :> 


un 

r*-. 


CS| 

e> 



& 

to 

to 

to 

X 


o 

— — ■ 

z> 

3 

c 

• — 

• — 

05 

c: 

o 

— 

a 

o 

to 

c 

hN< 

o 

u 


CJ 

u 

o 

u 


••• 


_c 

o 

o 


a> 

— * 

• 

CL 

CJ 

> 

3 

CL 


o 

LU 

O 

o 

• M. 

CJ 



-C 

1 

x> 

XI 


u 


2 S 


‘ “ 

u 

05 

05 

<c 




u 

tv 

o 

X 5 


! 


i 

o 

c 

o 

3 

1 


c* 

UJ 


>v 




a 


o 

» 

to 



o 

XI 

1 

u 

t— 


s 

■ * 

x: 

CL 


e> 


o 


3 £ 

Cl 


o 

1 

to 



— 

O 

to 



o o o 

j=: to to 

CL CL CL _ __ 

O Sfi *£ O O 


_o — 
o 

G *u 

0 a> 
to — * 

1 ^ 

4 — X> 

<u o 
-C c 
— • o 


I 

to m 
to 

CL 
CO 

x 


8s 


05 

J= 


O 

to — 
§ .2 
U U 
05 X> 

jr c 
w O 

x: 

o 

o 


cl a a cl cl 

o — — O o 

1 — L L L— L— 

-o “O 
c c 

o o 

_c x: 
o o 

o o 


e e 


UJ 

o 



E30[i] O O 


u 



Section 13 


U.S. Patent 


Sep. 14, 2010 


Sheet 14 of 62 


US 7,795,388 B2 



rgj.Ufl SC {lu] Q£ fu j LU U u Lj”u r uj] UJ 
- sis Ij :> > _ j] _j 

> <c 

: (Ji d] li_ f<] 3 C >* |<] a: |<c <c | a o; c 

:f <. Iscl < Isc p[ o ^ oz oz sc scl ^ 

:>< 2 zo><><<>> 

irEZZOZZCCCZGOZSCOO 

rm^nr —I "zrzji 

: Rolfd « < cfj^jlc < c <i c 

o Ql o 

JI-UCJUJ^liJI-Xl-KII 


Id oc lento: Id a: 


Id eg |co( QZ CZ. |cO CD) 


foci < joe j -J UJ ac p~ OC] Q fSSJ Q Q [OC~Q?j 1 

y ^ kJ SC 53 [ujjjjj sc jujj o sc [ijU_ LjJj i 

H^2a>£E<-l^2JCi£2S 


>UJ>UJOUJ^>LJ(/)UUJ 

CC>XK>->2X<V<> 

E5IIQ”5?53 ^ ftznCT*^i nzii c± cs 

EE EE ED > B Sg ) z itn tn[ 


!fi£l o joe] o @3 o [oc_ccj r o [acf o oltg.ee, a: 
R >. >, >» H < R RH -~j z> > — |~3] _J 
U sc U se U SC —lU Rfsc sc R — 1 _J 
IS RR-RR S £3 EI3 5SI53 •— 

G5I>[23» ED> [h- coi >» [to] >» >• ico col to 

>£3 > K3C2J :> h- E^y GS3 l^2] < 
EZLpi «< -pTjuRl 01 •“ pi |> !>| to to > 

sc lol iC |o -C o| SC SC |o] tO |o o| I— f— O 

<o to oo* ro K) kT CnT ocT 'in ^ cJ S' crT 

r^r^tor^-oor^r-^LOOoo*— or-r^o 

00000000^00^00^ 


03 


E 

cn 

CO 

<o 

X — 

o 

3— 

2 

O 


5 

3 

<z 


03 

c 

to 



o 

— 

<J 

o 

to 

C M 

o 

c 


o 

o 

« — 

O 

u 

-2 

L- O 


LjJ 


jCt 


JZ 

o 

o 


03 £ 

— » 

>- 

zc 

••* 

• 

CL 

o 

> 

Z3 

Cl 

o 



XT 

UJ 

O 

o 

• Mm 

L> 

• 

-C 

J 

1 

to 


■a 

JZ 

03 

L. 

* 2 

M-* 




1 

• •— 

o 

mJ 

03 



o 

o 

♦ 


o 

03 

03 

JZ 



JC 

CD 

CO 

1 

o 

cz 

O 

• =3 


< 

CL CL 


LlJ 


>N 

O 


a 


— 

tQ 


i 

° Si T 1 

— ' IT O LO 
a> to to 
-o — a_ cl 
a to to 
ai *- x: X 
E <— 

0 03 
ca -*-* 

1 8 

i- jO 
03 O 

-c c: 

— •* a 


CL O O CL Q- 

O — — O O 

H- V. i» H- •— 

“O no 
c c 
o o 


FIG.2M 



Section 14 


U.S. Patent 


Sep. 14, 2010 


Sheet 15 of 62 


US 7,795,388 B2 


j«t fo|«cjfo1 <j[o <[|o «7< < 

j— i > |— ij •— — * •— j— i — ij — — ii — —I — < — j 


jo Izlioi jo! < to o 2 * io O 4 

tujj o O [ujJ QC [uj] o|uj] O [ujj X ol uj L ijj t 

(S] Q (g) iijglQ fSr^ Qf^]Q Q (OC QC] < 
jo] UJ [o‘o o o“ o'o] U. w UJ u coco < 
03C0C/l»"-'UJ0*£Z02E2£0t9 
g|oF3]op > si: 

Q- I o <C <C 1 CC < I H- I { 1 — t— 

[<s I El*- IS I [<33 I [<}^ a- E2T 

(3 ^ EE) < 0 ^ EHI ^ o- >-i : 

lCJt/l5-J-JLIUJZJ_lZ2 

ococcLjaLjj^jji^cit^wuu 

a<S^&]uiS^53 - 

oc [31 oc V^j 3E^3] eg [3] >> [3 3] v- >- . 

p T cn <[ a. pc] cl pFTo) ac z cl cl [co co| ■ 

|-~>1 <c 1—1 c [ — i[ < — i — i c (_j -u t _j . 

W>U-JUI<UU<<00<< 
-a:<iu.o>^>*<a:Q:<< 
53 *— <c •< jut] _j fuj~tU t [qj _j _j grujj l 

|> ^ ^ >|Q_ :> — > > 1 . 


[o~o T5”o‘ - c5 _ o^o" o™"o~ e> 35" <333 c 
! o o o o o o o o o . o o o o ol < 
K o <T ol z jo ol >> [oj cl ac jo] a_ cl < 

[>]«<«< a_ [>] < g>3>il CL f>l pr^j r^>] : 

< (Ei >- ebeeh >:>:>>] : 

OC > «C ~ UJ — 2* — Z — — .LOCO 


°23^^ Nl ^ toCN|0l ^toxtinin 

— 000«— OOOfO^—^FfOOO 


E to in 
3 z> c 

— 0 0 

— <j l. 

-e o o 

Q. O > 

00 — 

"O -CZ <u 

— o — — 1 
o d) o 
o c o 

>v a 


iS 3 

° &5 T 

^IO 
a> to 
■JS — Cl 
O CO 

<y - rc 


x — o 
— a > c 
C is» O 
u. o — 
<c E — 
o. a 

c- 35 


D 

o _c: | 

-C CL 
OL — O 

— O to 
o CL 
<D to 


LUX) 

o> j r c 
-c — > o 


V) < < 7 ) 3 
3 X 2 £ 

I I I c 

o 

0000 
to -C jC 

85 35 

a: o o 

o 5Z 5Z 

*-00 


vi ^ uj ? 

ii 




L> 


FIG.2N 




Section 15 


U.S. Patent 


Sep. 14, 2010 


Sheet 16 of 62 


US 7,795,388 B2 


O 

in 


O 

O 

tn 


o 

o> 




ijxl ojxjo. (3 Q- [jx] ojjrjco V) BTx ] : 

IOU1 Jto J JOO>*Vl^li.U. 



> > > 

- - 

]o|> fol > |uj o( > ]uj! -j" oj 

O -j •— < tn t/1 LO K H 

ao.ouJH<Q.ow>aww 


ED 1 




lsl*-*RiS^(Cf 

&1 

UJ LU 

SJ3 J 

t j j 2 -J 3 -i > 

SC 

> > 

o a 

0 o o’o‘o’o"3i 

1 - _{ 7—^1 _ . ' 

|® 

“o 3 

J53 


I2£ 


-j u u tr 

.resn 


— * w—«- ’ 

U «o iSLSL 

L. LU LlJ GO 


|o 

i < 


:» se 


,z 


z z j 

:> :> 


••< c < <*<<: < c •< -c <ci < 

y Ejuausu.- a - «=_=. - -* . 

O Q£ — ■ O >* jh— h* fr— » fr— » t*— l-H > 

2aai<:>^i~o>- is 3 z z 

— — i [cl] — » [cu ~j [aTaJ! >- [cl] ■< c jaT'al] < 

SM * ' ! ' ’ ' ' 


i L ttJ cl i— 

o <C UJ c SC < 

| uj uj uj] > — fuT uTUj 
S ~j pj — i — i — ; 
<f~-< <x: c <• < 


-j a. L~i cx 


t— > t— — 





<c 


: o; o o: uj > uj u 2 

r<i — * ^ [<r<i o f<] ^ ^ r< j<] 

c nn > L{ ■ — . ^ ~6 ±[j3 

<c > < — c ^ u u 

UJitfLUOOZUUl— O < uj u < < 


»- l o»-^ooou5r-a)oou3a)Oir' 


0 ) 


i 

<A 

tfl 

to 

X 


O 

o 


zs 

c 



a> 

c 


o 

— — 

CJ 

o 

CO 

C 

rs 

o 

o 

CJ 

• ■M 

o 

u 

o 

u 

i 


-O 


x; 

o 

o 


V 




cx 

o 

> 

3 

Ou 


a 

IE 

UJ 

o 

o 

" — 

o 


« 

JZ 

<o 


■a 

-C 

t * 

u. 

• 


MP-» 


1 

• MM 


V 

0 ) 

<• 





o 

4 ) 

o 

-Q 


1 

• 

to 

_j 

a 

c 

o 

3 

1 


c 




00 

[sgloflolo o 


O 


FIG. 20 



U.S. Patent 


Sep. 14, 2010 


Sheet 17 of 62 


US 7,795,388 B2 


to 

*— 

tr> 



_j 3 x c lu <c 

o i*s t— a. <c 

3fE3u. op 

|3 S 3 

iof£*]o&To cf 
ucu>»-uj' 
o_ — u- > >- 

2 U UJ O > 

1 ! 1 1 02 L 

ioluj folse cnfo 


lk<kWOO-J>* 

Q<B<B<E> 

z z z [S] cc z z z 

Q >- <E>3 


it UiJI WUOO 

tr _> I ** ^ > 

3~s"3}ujpg" 3E jz^jZ 3E 

o o_o v HTsc aclojojci 

uTuj a: cr 

LU *— Li- Lt- O. CL- 


UJ *S O 
| I I 
“o o] os 
i* o; o 
< ~ 3 

I- <E1< 

fa a o 

f?T3^ 


Q it ^ 

J j I 

o o 61 o 
-J 2 : X 
as tn > 

< 15151 ^ 

o’ Q al a 

>■ iPPI ^ 


OO^O^iT!--^OinrOv-(JirOrO^ 
r^r-*p^r**oor-r-Nf~*.oco*~o>r^r^ — 


S en to 
3 c 
— u a 
— o »- 
-c 00 
Q. O > 
© © — 
-ox:-*-* 
*— o 0> 
o a> o 
o c o 

>N 

. cn 


;g2 

^ x to 
at cl. 

-O — CO 

a x 

I w 

O aj 


x •- O 
*— <LI C 
C N O 


1 o ^ 

£ ± ' 
CL — O 

— O to 
o a, 

g 1» 

lie 

C <t> *o 
© x: c 
-C H-* o 
— ' jC 

o 

o 


1 f { g 

0000 

CD X X 

8s -2-3 


0 0-0- 

•- o o 

t_ ►— I — 
“O 


Q_ 

CVI 

O 




Section 17 


U.S. Patent 


Sep. 14, 2010 


Sheet 18 of 62 


US 7,795,388 B2 


OO 

io 



cd— j ^ < k 

<QI-O^Q<W 

bfolafoWSa 

pS ^Ujl Q tLijf o tu O o 

c*"-** ! t— “ *"—» ! V—- ! w -* t— ' •— • 

pi! t— C£ [t— j Qd b~ gf cr 


Q- CL CL O. CL Q.j f 

IIT x x 

Cl > LiJ Ld -I -J 

> J— 1— I— ^ XI 

EFS^EF^l- 

25 I- J -wJ <— ■ J 

C H > <1q ol < 

•“ hR^h 7 

k— jo:* )— j— ioc qc; c 


>3 — >>Um— •> 
_J _J 3 3(1* -J To) 3 

»— C D — * -< XT o >-* —> 

:> [«c < <_< < n c < 

f<]:>l^r~ h — 1-< < 

juj] c/> LjJ co lj co uj^uj 

E— < F~ «< ^77 
g< *c /golfer c •< 

<f” >>"Lo ."z < z CO c 
bdOUO^O^Z 

EFBRERH^ 

ft < ■< < , *< <1 >r< 

z — i co o o 

3 |cc^*: ac x: ac xc| h- 

o ^ {s ra 

L> > > > > > >8^1 

— j |xc oc xsj fr— f^l — j jac 
c *— ~ c -< ;> > _j 
jaT^SIS-I] </> t§7flT gT 

|uJ Q uj Q O Q LxJ tZT 
1~ -J — >•[>- «< >-l> 

P^f5jMF^r>2 

jo O O OJ r|o[ag 3C: 

g CNJ — CN^f<7)tO^ 

o> oi <n 6 oo o> o> 


CO > c o CO N— — 

Io| o pC *< < -cT < 

>|<1> >]c < c 

CO(_Lijjo o Ui UJ UJ 

c 7_7t *“ 

<3" 7 <.■?!? < 

D to Q Q U- L 

> — > — i ^ ^ 

pj ~J — J I j — 1 

<|uojc < fco~co[ <c 
h- x: h-~ V- x: ^ 
Cg]> fog Qc j > > QC 
— [xcj :> >Jx: icjx 
[§3 CO U. [> > > j > 
xT]*— jsg xT) -^ —* M 

— » H- 1 — H* ►— H— 

qT~q 3<— [oTliraj CL 

o UJ CD Q UJ uj|q 


fcFfe=r“>]: 

IO O O CD C D o} < 
V) co o) m <\ i 

CNt CVJ -r- O O - 

to to ^ in in in i 


a» 


! 

«o 

CO 

V5 

X 

•— 

o 

o 

— - 

3 

c 

i — 

• — 

<u 

c 


o 

*— • 

O 

a 

to 

C 

rM 

o 

a 

u 

*— 

o 

L- 

O 

L. 

O 


JO 


-C 

o 

o 


aj 

e 

Mb 

•— 

* 

cx 

o 

> 

3 

cx 


o 


JC UJ o o — 

co -ox:—* 

f •— O <D 

o <u o 
co — I o c: o 

Li_l >s 

J O * <o 


5- X 3 X 3 <u 

M 

I I I I I C 

o o o o o (3 

jt co co jc -c 

CL Q_ CL CL CL 

O x X o D 


.2 e s 

<u cl 

-O — CO 
O X 
a* — 

e «- 

O 4> 


& O O CL Q_ 

— - O <J 

H L U h* H— 

*o “O 

c c 

o o 

-5 -5 

o o 


FIG.2Q 



Section 18 


U.S. Patent 


Sep. 14, 2010 


Sheet 19 of 62 


US 7,795,388 B2 


909 

( 

l 

2 

2 

i 

1 

J 

1 

1 1 
1 1 

l 

i 

1 

1 

1 

1 

l 

1 

1 

1 

1 

1 

1 

1 

! 

1 



1 

CD 

J 

} 

1 J 

l 

1 

1 

i 

1 

1 

\ 

1 



1 

CD 

i 

J 

) 1 

Lx 

l 

1 

1 

1 

1 

1 

] 



1 

2 

> 

i 

1 1 

UJ 

1 

1 

1 

1 

I 

1 

I 



I 

CD 

i 

>-~ 

I Lx 

Lx 

1 

>- 

1 

Lx 

— J 

1 

1 



l 

CD 


Q 

J CD 



Q 

1 

2 

2 

CD 

CD 



1 

2 

» 

lx. 

( 2 

CO 

1 

2 

1 

O 

00 

2C 

CD 



l 

CD 

i 

CD 

1 CD 

o 

1 

O 

1 

O 

CD 

) 

-J 



OjoiO |o 

. . 1 , , !-ri 
CU |2 j Lxl 1 2 j 

1 JO O 

i iaslo 


Foi 

ki 

1 

l 

jO cdi 

R 3 I 

E 

CD 

HI 

CD 

CD 

2 


(O 

CD 

CO 

CD 

t CD 

UJ 

2 

o 

I 

CD 

CD 

CO 

CO 



CO 

CD 

CO 

CD 

1 O 

UJ 

o 

o 

l 

CD 

CD 

DC 

DC 



be 

c- 

UJ 

CD 

1 O 

UJ 

UJ 

2 

I 

CD 

CD 

D> 

> 



Lxl 

< 

CO 

C 

1 O 

UJ 

2 

O 

I 

2 

2 

< 

<c 



UJ 

CD 

CO 

CD 

!. Ol 

UJ 

o 

< 

1 

O 

CD 

CD 

z: 




i 

CO 

< 

1 > 

UJ 

o 

o 

l 

Q 

CD 

UJ 

UJ 


O 

LjJ 

CD 

CD 

Q_ 

1 CO 

o 

o 

<c 

l 

2 

2 

>- 

>- 


?) 


El 

CO 

Z] 

i. S] 

►— 

CL 

53 

rn 

El 

<C 

CO 


<C 



< 



I 

DC 

2 

o 

cd 

o 

CD 

CD 

o 



o 

o 

Q_ 

UJ 

1 Lx) 

o 

O 

CO 

zn 

2 

2 

nz 

nz 



o 

z 

O- 


i DZ 

DC 

CD 

UJ 

OL 

CD 

CD 

DC 




Q_ 

1 

z 

) 

i i 


CL 

CO 

CD 

Q_ 

CL 

CD 

CD 



0 

1 

CO 

1 

} i luT 

C 

{ 1 

EE 

to - 

Q 

lET 

J=D 

CD 


CO 

1 

CO 

1 

i \ 


C 

1 

DC 

DC 

DC 

DZ 




o 

1 

CO 

1 

i i 

DC 

QC 

UJ 

Q_ 

UJ 

LJ 

1 

1 


0> 

CD 

1 

CO 

CL 

i i 


O 

cu 

Q_ 

UJ 

UJ 

co 

CO 


i ^ 

UO 



CO 

UJ 

( UJ 

CL 

CD 


UJ 

DC 

DC 

UJ 

UJ 


cn 

r^ 

UO 

fO 

to 

CO 

Ovl 

to 

to 

CM 

to 

CD CO rO 
CM tO CM 
UO UO UO 

o' 

*0 

uO 

to 

CM 

O 

uo 

to 

cT 

UO 

UO 

to' 

to 

UO 

UO 

tzr 

to 

UO 

CO 

to 

UO 

(579] 


QH 

04 

O 


0> 

• — 


cn 

cn 

cn 

X 

— 

O 

1 — 


Lx 

1 ZZ 

UJ 

cn 

o 

— 

s 

25 

c 

*— 

<«■ 

CD 

C 

CO 


cn 

» 

CO 

3 


o 

— 

O 

0 

cn 

c 

tM 

O 

-< 


=z 

i 

ZD 

cn 

o 

o 

««— 

O 

X 

O 

X 

O 


UJ 

5 

s 

» ZD 

S 

c: 

_o 


JC 

O 

0 


CD 

£ 


>- 

zc 

■ zn 

CD 

• — 


CL 

O 

> 

0 

CL 


O 






cn 

-C 

UJ 

O 

O 

. — 

0 



JC 

t 

i 

} 

1 

1 

c 

00 


-O 

jd 

f ^ 

X 

V 








0 


1 

* — . 

0 

<u 

CD 

< 



0 

0 

0 

1 0 

O 

CD 

CO 


O 

o> 

0 

_Q 

1 


JI 

to 

<0 -O 

jC 

» 

O 

c 

0 

3 

i 


<C 

CL CL 

Q_ 

CL 

CL 



UJ 


>■* 


xJ 


0 


CO 

CO — 

— 


1 

o 


cn 

• 


0 

-C 

1 

O 

3Z 

ZE 

O 

O 



X 

1 — 


2 


JC 

CL 








o 

CD 


O 


2 

CL 

— 

O 

T— 

— 


• — 

*— 



1 

co 



— 

O 

to 

CL 

O 

a 

CL 

CL 


CD 



CL 

1 

1 

O 


CL 

CD 

• — 

* — 

O 

O 


UD 


O 

CO 

O 



<U 

CO 


X 

X 

1 — 

1 — 




x 

zc 

to 

UO 

o> 

E 

DZ 


T5 

*0 



UO 


tD 


CL 

to 

e 

0 



C 

C 




UO 


O 

— 

CO 

CL 

0 

tn 

— 


O 

O 




Lx 



0 

nr 

CO 

cn 

O 

a 


_c 

_c 




1 — 


O 



ZC 

Q 

E 

— 


0 

0 





P 

X 



E 

X 

X 


0 

a 

r 





5 

O 



X 

CD 

T3 


-t— J 







LO 

-w 



<D 

-C 

cr 


• — 

• — 






Q 

O 



U= 


O 


E 

E 






P 

O 



-X 


XT 









X 

JD 





(J 









0 

O 





0 









JC 

C 















O 





« 










> 





E 










u 













S 3$ 


r> 

Cl 


5 5$ S £ 

^ 3 ^ ^ 2 

W 2 y 5 

=; fe | " i 

S ~ S >J on 

s S-sssg 

5 0 CD O ? 2 

§ r^if^i rsio o 





U.S. Patent 


Sep. 14, 2010 


Sheet 21 of 62 


US 7,795,388 B2 



FIG. 4 (Prior Art) 
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FIG.5C 
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FIG.6A 



FIG.6B 
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FIG.7D 



FIG.7E 
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FIG. 12 
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150 nm 


FIG. 13 
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ATGGCAataaatgatactgacgtacttaggaaagtagcattaacatcctt 

aggcagtaaggcagtagcaggcgcacgagagtatttagctgaccttgtgg 

ttaaagcagtggcacaagtagcagaattaagaggagataagtggtatgtt 

gatctagataatgtacaaatagttaaaaaacatggtggtagcattaatga 

tactcaattagtatacggcatagtagttgataaggaagttgtacatccgg 

gcatgccaaagaggattgaaaatgctaagatagcccttttagacgcttca 

ttagaagttgagaaacccgaattggatgcagaaataagaattaacgatcc 

aacacagatgcacaaattcttggaagaagaagaaaacatattgaaagaaa 

aagtagataagattgcagctactggtgctaacgttgtaatagcgcagaaa 

ggtatcgatgaagttgcacaacactatttagctaagaaaggtatattagc 

tgttaggagagccaagaagagtgatttagagaaattagctagagctaccg 

gaggtagagtcatatcaaatattgatgaattaacttcacaagatctaggt 

tatgccgcattagtggaagagagaaaagtaggtgaagacaagatcgtatt 

cgtagaaggtgcaaagaatccaaaatcagttagtatactaataagaggag 

gattagagagagtagtagatgagactgaaagagctcttagggacgcttta 

ggtacagtggcagatgtaataagggatggtagagcagtagctggtggtgg 

agctgttgagatagagatagctaagagattaagaaagtatgccccacaag 

ttggtggtaaagagcaattagcaattgaagcatatgctaatgcaatagag 

ggtctcattatgatattggcggaaaacgcaggattagatcctatagacaa 

attaatgcaattaagaagtcttcacgagaatgagaccaataaatggtatg 

gacttaatttatttactggaaatccagaggatatgtggaaattaggtgtt 

attgaaccggcactagtgaaaatgaatgcaattaaggctgcaacagaagc 

agtaacattagtgttaagaatagatgatattgtaGGTGGTTCTGGTGGTA 

CCatacctgtaataattttaaaagagggatcaagtagaacatatggaaaa 

gaagctttaagggctaatattgctgcagtgaaagcaattgaagaggcatt 

aaaaagcacctatggtccacgtggaatggataagattcttgttgatagct 

taggagatattacaataacaaatgatggagccactattcttgataaaatg 

gatttacaacacccaacaggtaagcttttagttcagatagctaaaggaca 

agacgaggaaacagctgatggcactaaaactgctgtaattcttgctggag 

aattagctaaaaaagcagaagatcttttatataaggagattcacccaaca 

ataattgtaagcggatataagaaggcagaagaaattgcattaaagaccat 

ccaagatatagcacaaccggtcagc 


FIGURE 2 1A 
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MAINDTDVLRKVALTSLGSKAVAGAREYLADLWKAVAQVAELRGDKWYV 
DLDNVQI VKKHGGS I NDTQLVYGI WDKE WH PGMPKR I ENAKI ALLDAS 
LEVEKPELDAEIRINDPTQMHKFLEEEENILKEKVDKIAATGANWIAQK 
GIDEVAQHYLAKKGILAVRRAKKSDLEKLARATGGRVISNIDELTSQDLG 
YAALVEERKVGE DKI VFVEGAKNPKSVS I L I RGGLERWDETERALRDAL 
GTVADVI RDGRAVAGGGAVE I E I AKRLRKYAPQVGGKEQLAI EAYANAIE 
GLIMILAENAGLDPIDKLMQLRSLHENETNKWYGLNLFTGNPEDMWKLGV 
IEPALVKMNAIKAATEAVTLVLRIDDIVGGSGGTI PVI ILKEGSSRTYGK 
EALRAN I AAVKAI E EALKS T Y G PRGMDK I LVDS LGD I T I TNDGAT I LDKM 
DLQHPTGKLLVQIAKGQDEETADGTKTAVILAGELAKKAEDLLYKEIHPT 
IIVSGYKKAEEIALKTIQDIAQPVS 


FIGURE 2 IB 
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ATGGCAgatccaacacagatgcacaaattcttggaagaagaagaaaacat 

attgaaagaaaaagtagataagattgcagctactggtgctaacgttgtaa 

tagcgcagaaaggtatcgatgaagttgcacaacactatttagctaagaaa 

ggtatattagctgttaggagagccaagaagagtgatttagagaaattagc 

tagagctaccggaggtagagtcatatcaaatattgatgaattaacttcac 

aagatctaggttatgccgcattagtggaagagagaaaagtaggtgaagac 

aagatcgtattcgtagaaggtgcaaagaatccaaaatcagttagtatact 

aataagaggaggattagagagagtagtagatgagactgaaagagctctta 

gggacgctttaggtacagtggcagatgtaataagggatggtagagcagta 

gctggtggtggagctgttgagatagagatagctaagagattaagaaagta 

tgccccacaagttggtggtaaagagcaattagcaattgaagcatatgcta 

atgcaatagagggtctcattatgatattggcggaaaacgcaggattagat 

cctatagacaaattaatgcaattaagaagtcttcacgagaatgagaccaa 

taaatggtatggacttaatttatttactggaaatccagaggatatgtgga 

aattaggtgttattgaaccggcactagtgaaaatgaatgcaattaaggct 

gcaacagaagcagtaacattagtgttaagaatagatgatattgtaGGTGG 

TTCTGGTGGTACCatacctgtaataattttaaaagagggatcaagtagaa 

catatggaaaagaagctttaagggctaatattgctgcagtgaaagcaatt 

gaagaggcattaaaaagcacctatggtccacgtggaatggataagattct 

tgttgatagcttaggagatattacaataacaaatgatggagccactattc 

ttgataaaatggatttacaacacccaacaggtaagcttttagttcagata 

gctaaaggacaagacgaggaaacagctgatggcactaaaactgctgtaat 

tcttgctggagaattagctaaaaaagcagaagatcttttatataaggaga 

ttcacccaacaataattgtaagcggatataagaaggcagaagaaattgca 

ttaaagaccatccaagatatagcacaaccggtcagcataaatgatactga 

cgtacttaggaaagtagcattaacatccttaggcagtaaggcagtagcag 

gcgcacgagagtatttagctgaccttgtggttaaagcagtggcacaagta 

gcagaattaagaggagataagtggtatgttgatctagataatgtacaaat 

agttaaaaaacatggtggtagcattaatgatactcaattagtatacggca 

tagtagttgataaggaagttgtacatccgggcatgccaaagaggattgaa 

aatgctaagatagcccttttagacgcttcattagaagttgagaaacccga 

attggatgcagaaataagaattaac 


FIGURE 22A 
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MADPTQMHKFLEEEENI LKEKVDKIAATGANWI AQKGI DEVAQH YLAKK 
GILAVRRAKKSDLEKLARATGGRVISNIDELTSQDLGYAALVEERECVGED 
KI VFVEGAKN PKSVS I L I RGGLERWDETERALRDALGTVADVI RDGRAV 
AGGGAVE I E IAKRLRKYAPQVGGKEQLAI EAYANAI EGLIMI LAENAGLD 
PIDKLMQLRSLHENETNKWYGLNLFTGNPEDMWKLGVIEPALVKMNAIKA 
ATEAVTLVLRI DDIVGGSGGT I PVI I LKEGSSRTYGKEALRANIAAVKAI 
EEALKST YGPRGMDKI LVDS LGD I T I TNDGAT I LDKMDLQH PTGKLLVQI 
AKGQDEETADGTKTAVI LAGE LAKKAE DLLYKE I H PT 1 1 VSGYKKAEE I A 
LKT I QD I AQ PVS INDT DVLRKVALT S LGSKAVAGARE YLADLWKAVAQV 
AELRGDKWYVDLDNVQI VKKHGGS I NDTQLVYGI WDKEWH PGMPKR I E 
NAKI ALLDAS LEVE KPE LDAE I R I N 


FIGURE 22B 
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ATGGCAatattagctgttaggagagccaagaagagtgatttagagaaatt 

agctagagctaccggaggtagagtcatatcaaatattgatgaattaactt 

cacaagatctaggttatgccgcattagtggaagagagaaaagtaggtgaa 

gacaagatcgtattcgtagaaggtgcaaagaatccaaaatcagttagtat 

actaataagaggaggattagagagagtagtagatgagactgaaagagctc 

ttagggacgctttaggtacagtggcagatgtaataagggatggtagagca 

gtagctggtggtggagctgttgagatagagatagctaagagattaagaaa 

gtatgccccacaagttggtggtaaagagcaattagcaattgaagcatatg 

ctaatgcaatagagggtctcattatgatattggcggaaaacgcaggatta 

gatcctatagacaaattaatgcaattaagaagtcttcacgagaatgagac 

caataaatggtatggacttaatttatttactggaaatccagaggatatgt 

ggaaattaggtgttattgaaccggcactagtgaaaatgaatgcaattaag 

gctgcaacagaagcagtaacattagtgttaagaatagatgatattgtaGG 

TGGTTCTGGTGGTACCatacctgtaataattttaaaagagggatcaagta 

gaacatatggaaaagaagctttaagggctaatattgctgcagtgaaagca 

attgaagaggcattaaaaagcacctatggtccacgtggaatggataagat 

tcttgttgatagcttaggagatattacaataacaaatgatggagccacta 

ttcttgataaaatggatttacaacacccaacaggtaagcttttagttcag 

atagctaaaggacaagacgaggaaacagctgatggcactaaaactgctgt 

aattcttgctggagaattagctaaaaaagcagaagatcttttatataagg 

agattcacccaacaataattgtaagcggatataagaaggcagaagaaatt 

gcattaaagaccatccaagatatagcacaaccggtcagcataaatgatac 

tgacgtacttaggaaagtagcattaacatccttaggcagtaaggcagtag 

caggcgcacgagagtatttagctgaccttgtggttaaagcagtggcacaa 

gtagcagaattaagaggagataagtggtatgttgatctagataatgtaca 

aatagttaaaaaacatggtggtagcattaatgatactcaattagtatacg 

gcatagtagttgataaggaagttgtacatccgggcatgccaaagaggatt 

gaaaatgctaagatagcccttttagacgcttcattagaagttgagaaacc 

cgaattggatgcagaaataagaattaacgatccaacacagatgcacaaat 

tcttggaagaagaagaaaacatattgaaagaaaaagtagataagattgca 

gctactggtgctaacgttgtaatagcgcagaaaggtatcgatgaagttgc 

acaacactatttagctaagaaaggt 


FIGURE 23A 
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MAILAVRRAKKSDLEKLARATGGRVISNIDELTSQDLGYAALVEERKVGE 
DKIVFVEGAKNPKSVS I LI RGGLERWDETERALRDALGTVADVI RDGRA 
VAGGGAVE I E I AKRLRKYAPQVGGKEQLAI EAYANAI EGLIMI LAENAGL 
DPIDKLMQLRSLHENETNKWYGLNLFTGNPEDMWKLGVIEPALVKMNAIK 
AATEAVTLVLRI DDIVGGSGGT I PVI I LKEGS S RT Y GKEALRAN I AAVKA 
IEEALKSTYGPRGMDKILVDSLGDITITNDGATILDKMDLQHPTGKLLVQ 
IAKGQDEETADGTKTAVILAGELAKKAEDLLYKEI HPTI IVSGYKKAEEI 
ALKT I QDIAQPVS I NDT DVLRKVALTSLGSKAVAGARE Y LADLWKAVAQ 
VAELRGDKWYVDLDNVQIVKKHGGSINDTQLVYGIWDKEWHPGMPKRI 
ENAKIALLDASLEVEKPELDAEIRINDPTQMHKFLEEEENILKEKVDKIA 
ATGANWI AQKGI DEVAQHY LAKKG 


FIGURE 23B 
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ATGGCAaataaatggtatggacttaatttatttactggaaatccagagga 

tatgtggaaattaggtgttattgaaccggcactagtgaaaatgaatgcaa 

ttaaggctgcaacagaagcagtaacattagtgttaagaatagatgatatt 

gtaGGTGGTTCTGGTGGTACCatacctgtaataattttaaaagagggatc 

aagtagaacatatggaaaagaagctttaagggctaatattgctgcagtga 

aagcaattgaagaggcattaaaaagcacctatggtccacgtggaatggat 

aagattcttgttgatagcttaggagatattacaataacaaatgatggagc 

cactattcttgataaaatggatttacaacacccaacaggtaagcttttag 

ttcagatagctaaaggacaagacgaggaaacagctgatggcactaaaact 

gctgtaattcttgctggagaattagctaaaaaagcagaagatcttttata 

taaggagattcacccaacaataattgtaagcggatataagaaggcagaag 

aaattgcattaaagaccatccaagatatagcacaaccggtcagcataaat 

gatactgacgtacttaggaaagtagcattaacatccttaggcagtaaggc 

agtagcaggcgcacgagagtatttagctgaccttgtggttaaagcagtgg 

cacaagtagcagaattaagaggagataagtggtatgttgatctagataat 

gtacaaatagttaaaaaacatggtggtagcattaatgatactcaattagt 

atacggcatagtagttgataaggaagttgtacatccgggcatgccaaaga 

ggattgaaaatgctaagatagcccttttagacgcttcattagaagttgag 

aaacccgaattggatgcagaaataagaattaacgatccaacacagatgca 

caaattcttggaagaagaagaaaacatattgaaagaaaaagtagataaga 

ttgcagctactggtgctaacgttgtaatagcgcagaaaggtatcgatgaa 

gttgcacaacactatttagctaagaaaggtatattagctgttaggagagc 

caagaagagtgatttagagaaattagctagagctaccggaggtagagtca 

tatcaaatattgatgaattaacttcacaagatctaggttatgccgcatta 

gtggaagagagaaaagtaggtgaagacaagatcgtattcgtagaaggtgc 

aaagaatccaaaatcagttagtatactaataagaggaggattagagagag 

tagtagatgagactgaaagagctcttagggacgctttaggtacagtggca 

gatgtaataagggatggtagagcagtagctggtggtggagctgttgagat 

agagatagctaagagattaagaaagtatgccccacaagttggtggtaaag 

agcaattagcaattgaagcatatgctaatgcaatagagggtctcattatg 

atattggcggaaaacgcaggattagatcctatagacaaattaatgcaatt 

aagaagtcttcacgagaatgagacc 


FIGURE 24A 
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MANKWYGLNLFTGNPEDMWKLGVIEPALVKMNAIKAATEAVTLVLRIDDI 
VGGSGGT I PVI I LKEGSSRT YGKEALRANIAAVKAIEEALKSTYGPRGMD 
KI LVDSLGD I T ITNDGAT I LDKMDLQH PTGKLLVQI AKGQDEETADGTKT 
AVI LAGE LAKPCAE DLL YKE I H PT I I VS G YKKAEE I ALKT I QD I AQ PVS I N 
DTDVLRKVALTSLGSKAVAGAREYLADLWKAVAQVAELRGDKWYVDLDN 
VQIVKKHGGSINDTQLVYGIWDKEWHPGMPKRIENAKIALLDASLEVE 
KPELDAE IRINDPTQMHKFLEEEENI LKEKVDKIAATGANWI AQKGI DE 
VAQHYLAKKGILAVRRAKKSDLEKLARATGGRVISNIDELTSQDLGYAAL 
VEERBCVGEDKIVFVEGAKNPKSVSILIRGGLERWDETERALRDALGTVA 
DVI RDGRAVAGGGAVE I E I AKRLRKYAPQVGGKEQLAI EAYANAI EGL IM 
ILAENAGLDPIDKLMQLRSLHENET 


FIGURE 24B 
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ATGGCAttaggtgttattgaaccggcactagtgaaaatgaatgcaattaa 

ggctgcaacagaagcagtaacattagtgttaagaatagatgatattgtaG 

GTGGTTCTGGTGGTACCatacctgtaataattttaaaagagggatcaagt 

agaacatatggaaaagaagctttaagggctaatattgctgcagtgaaagc 

aattgaagaggcattaaaaagcacctatggtccacgtggaatggataaga 

ttcttgttgatagcttaggagatattacaataacaaatgatggagccact 

attcttgataaaatggatttacaacacccaacaggtaagcttttagttca 

gatagctaaaggacaagacgaggaaacagctgatggcactaaaactgctg 

taattcttgctggagaattagctaaaaaagcagaagatcttttatataag 

gagattcacccaacaataattgtaagcggatataagaaggcagaagaaat 

tgcattaaagaccatccaagatatagcacaaccggtcagcataaatgata 

ctgacgtacttaggaaagtagcattaacatccttaggcagtaaggcagta 

gcaggcgcacgagagtatttagctgaccttgtggttaaagcagtggcaca 

agtagcagaattaagaggagataagtggtatgttgatctagataatgtac 

aaatagttaaaaaacatggtggtagcattaatgatactcaattagtatac 

ggcatagtagttgataaggaagttgtacatccgggcatgccaaagaggat 

tgaaaatgctaagatagcccttttagacgcttcattagaagttgagaaac 

ccgaattggatgcagaaataagaattaacgatccaacacagatgcacaaa 

ttcttggaagaagaagaaaacatattgaaagaaaaagtagataagattgc 

agctactggtgctaacgttgtaatagcgcagaaaggtatcgatgaagttg 

cacaacactatttagctaagaaaggtatattagctgttaggagagccaag 

aagagtgatttagagaaattagctagagctaccggaggtagagtcatatc 

aaatattgatgaattaacttcacaagatctaggttatgccgcattagtgg 

aagagagaaaagtaggtgaagacaagatcgtattcgtagaaggtgcaaag 

aatccaaaatcagttagtatactaataagaggaggattagagagagtagt 

agatgagactgaaagagctcttagggacgctttaggtacagtggcagatg 

taataagggatggtagagcagtagctggtggtggagctgttgagatagag 

atagctaagagattaagaaagtatgccccacaagttggtggtaaagagca 

attagcaattgaagcatatgctaatgcaatagagggtctcattatgatat 

tggcggaaaacgcaggattagatcctatagacaaattaatgcaattaaga 

agtcttcacgagaatgagaccaataaatggtatggacttaatttatttac 

tggaaatccagaggatatgtggaaa 


FIGURE 25A 
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MALGVI E PALVKMNAI KAAT EAVT LVLR I DD I VGGSGGT I PVI I LKEGSS 
RTYGKEALRANIAAVKAIEEALKSTYGPRGMDKILVDSLGDITITNDGAT 
ILDKMDLQHPTGKLLVQIAKGQDEETADGTKTAVILAGELAKKAEDLLYK 
E I HPT I IVSGYKKAEE I ALKT I QDIAQPVS INDTDVLRKVALT SLGSKAV 
AGARE Y LAD LWKAVAQVAE LRGDKW YVDLDNVQI VKKHGGS I NDTQLVY 
GIWDKEWHPGMPKRIENAKIALLDASLEVEKPELDAEIRINDPTQMHK 
FLEEEENILKEKVDKIAATGANWIAQKGI DEVAQH YLAKKGI LAVRRAK 
KSDLEKLARATGGRVISNIDELTSQDLGYAALVEERKVGEDKIVFVEGAK 
NPKSVS I LI RGGLERWDE T ERALRDALGTVADVI RDGRAVAGGGAVE I E 
IAKRLRKYAPQVGGKEQLAI EAYANAI EGLIMI LAENAGLDPI DKLMQLR 
SLHENETNKWYGLNLFTGNPEDMWK 


FIGURE 25B 



U.S. Patent 


Sep. 14, 2010 


Sheet 51 of 62 


US 7,795,388 B2 


ATGGCAgatccaacacagatgcacaaattcttggaagaagaagaaaacat 

attgaaagaaaaagtagataagattgcagctactggtgctaacgttgtaa 

tagcgcagaaaggtatcgatgaagttgcacaacactatttagctaagaaa 

ggtatattagctgttaggagagccaagaagagtgatttagagaaattagc 

tagagctaccggaggtagagtcatatcaaatattgatgaattaacttcac 

aagatctaggttatgccgcattagtggaagagagaaaagtaggtgaagac 

aagatcgtattcgtagaaggtgcaaagaatccaaaatcagttagtatact 

aataagaggaggattagagagagtagtagatgagactgaaagagctctta 

gggacgctttaggtacagtggcagatgtaataagggatggtagagcagta 

gctggtggtggagctgttgagatagagatagctaagagattaagaaagta 

tgccccacaagttggtggtaaagagcaattagcaattgaagcatatgcta 

atgcaatagagggtctcattatgatattggcggaaaacgcaggattagat 

cctatagacaaattaatgcaattaagaagtcttcacgagaatgagaccaa 

taaatggtatggacttaatttatttactggaaatccagaggatatgtgga 

aattaggtgttattgaaccggcactagtgaaaatgaatgcaattaaggct 

gcaacagaagcagtaacattagtgttaagaatagatgatattgtaGGTGG 

TTCTGGTGGTACCatacctgtaataattttaaaagagggatcaagtagaa 

catatggaaaagaagctttaagggctaatattgctgcagtgaaagcaatt 

gaagaggcattaaaaagcacctatggtccacgtggaatggataagattct 

tgttgatagcttaggagatattacaataacaaatgatggagccactattc 

ttgataaaatggatttacaacacccaacaggtaagcttttagttcagata 

gctaaaggacaagacgaggaaacagctgatggcactaaaactgctgtaat 

tcttgctggagaattagctaaaaaagcagaagatcttttatataaggaga 

ttcacccaacaataattgtaagcggatataagaaggcagaagaaattgca 

ttaaagaccatccaagatatagcacaaccggtcagcataaatgatactga 

cgtacttaggaaagtagcattaacatccttaggcagtaaggcagtagcag 

gcgcacgagagtatttagctgaccttgtggttaaagcagtggcacaagta 

gcagaattaagaggagataagtggtatgttgatctagataatgtacaaat 

agttaaaaaacatggtggtagcattaatgatactcaattagtatacggca 


FIGURE 26A-1 
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tagtagttgataaggaagttgtacatccgggcatgccaaagaggattgaa 

aatgctaagatagcccttttagacgcttcattagaagttgagaaacccga 

attggatgcagaaataagaattaacggcagcggcggatccggggtgagca 

agggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggac 

ggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgagggcga 

tgccacctacggcaagctgaccctgaagttcatctgcaccaccggcaagc 

tgcccgtgccctggcccaccctcgtgaccaccttcggctacggcctgcag 

tgcttcgcccgctaccccgaccacatgaagcagcacgacttcttcaagtc 

cgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacg 

acggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctg 

gtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacat 

cctggggcacaagctggagtacaacGGCGGTACCGGATCTGGAGGTGAGC 

TCaacagccacaacgtctatatcatggccgacaagcagaagaacggcatc 

aaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagct 

cgccgaccactaccagcagaacacccccatcggcgacggccccgtgctgc 

tgcccgacaaccactacctgagctaccagtccgccctgagcaaagacccc 

aacgagaagcgcgatcacatggtcctgctggagttcgtgaccgccgccgg 

gatcactctcggcatggacgagctgtacaagcaccaccatcaccatcac 


FIGURE 26A-2 
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MADPTQMHKFLEEEENILKEKVDKIAATGANWIAQKGIDEVAQHYLAKK 
GILAVRRAKKSDLEKLARATGGRVI SNI DELTSQDLGYAALVEERKVGED 
KIVFVEGAKNPKSVSILIRGGLERWDETERALRDALGTVADVIRDGRAV 
AGGGAVEI E I AKRLRKYAPQVGGKEQLAI EAYANAI EGLIMI LAENAGLD 
PI DKLMQLRS LHENETNKWYGLNL FT GN PE DMWKLGVI E PALVKMNAI KA 
ATEAVTLVLRI DDIVGGSGGT I PVI I LKEGSSRT YGKEALRANIAAVKAI 
EEALKSTYGPRGMDKILVDSLGDITITNDGATILDKMDLQHPTGKLLVQI 
AKGQDEETADGTKTAVI LAGE LAKKAE DLL YKE I H PT 1 1 VSGYKKAEE I A 
LKT I QDIAQ PVS INDT DVLRKVALT S LGSKAVAGARE YLADLWKAVAQV 
AELRGDKWYVDLDNVQI VKKHGGS INDTQLVYGI WDKEWH PGMPKRIE 
NAKIALLDASLEVEKPELDAEIRINGSGGSGVSKGEELFTGWPILVELD 
GDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTFGYGLQ 
CFARYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTL 
VNRIELKGIDFKEDGNILGHKLEYNGGTGSGGELNSHNVYIMADKQKNGI 
KVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSYQSALSKDP 
NEKRDHMVLLEFVTAAGITLGMDELYKHHHHHH 


FIGURE 26B 
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ATGGCAaataaatggtatggacttaatttatttactggaaatccagagga 

tatgtggaaattaggtgttattgaaccggcactagtgaaaatgaatgcaa 

ttaaggctgcaacagaagcagtaacattagtgttaagaatagatgatatt 

gtaGGTGGTTCTGGTGGTACCatacctgtaataattttaaaagagggatc 

aagtagaacatatggaaaagaagctttaagggctaatattgctgcagtga 

aagcaattgaagaggcattaaaaagcacctatggtccacgtggaatggat 

aagattcttgttgatagcttaggagatattacaataacaaatgatggagc 

cactattcttgataaaatggatttacaacacccaacaggtaagcttttag 

ttcagatagctaaaggacaagacgaggaaacagctgatggcactaaaact 

gctgtaattcttgctggagaattagctaaaaaagcagaagatcttttata 

taaggagattcacccaacaataattgtaagcggatataagaaggcagaag 

aaattgcattaaagaccatccaagatatagcacaaccggtcagcataaat 

gatactgacgtacttaggaaagtagcattaacatccttaggcagtaaggc 

agtagcaggcgcacgagagtatttagctgaccttgtggttaaagcagtgg 

cacaagtagcagaattaagaggagataagtggtatgttgatctagataat 

gtacaaatagttaaaaaacatggtggtagcattaatgatactcaattagt 

atacggcatagtagttgataaggaagttgtacatccgggcatgccaaaga 

ggattgaaaatgctaagatagcccttttagacgcttcattagaagttgag 

aaacccgaattggatgcagaaataagaattaacgatccaacacagatgca 

caaattcttggaagaagaagaaaacatattgaaagaaaaagtagataaga 

ttgcagctactggtgctaacgttgtaatagcgcagaaaggtatcgatgaa 

gttgcacaacactatttagctaagaaaggtatattagctgttaggagagc 

caagaagagtgatttagagaaattagctagagctaccggaggtagagtca 

tatcaaatattgatgaattaacttcacaagatctaggttatgccgcatta 

gtggaagagagaaaagtaggtgaagacaagatcgtattcgtagaaggtgc 

aaagaatccaaaatcagttagtatactaataagaggaggattagagagag 

tagtagatgagactgaaagagctcttagggacgctttaggtacagtggca 

gatgtaataagggatggtagagcagtagctggtggtggagctgttgagat 

agagatagctaagagattaagaaagtatgccccacaagttggtggtaaag 

agcaattagcaattgaagcatatgctaatgcaatagagggtctcattatg 


FIGURE 27 A- 1 
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atattggcggaaaacgcaggattagatcctatagacaaattaatgcaatt 

aagaagtcttcacgagaatgagaccggctctggcggatccggatccgggg 

tgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgag 

ctggacggcgacgtaaacggccacaagttcagcgtgtccggcgagggcga 

gggcgatgccacctacggcaagctgaccctgaagttcatctgcaccaccg 

gcaagctgcccgtgccctggcccaccctcgtgaccaccttcggctacggc 

ctgcagtgcttcgcccgctaccccgaccacatgaagcagcacgacttctt 

caagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttca 

aggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgac 

accctggtgaaccgcatcgagctgaagggcatcgacttcaaggaggacgg 

caacatcctggggcacaagctggagtacaacGGCGGTACCGGATCTGGAG 

GTGAGCTCaacagccacaacgtctatatcatggccgacaagcagaagaac 

ggcatcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgt 

gcagctcgccgaccactaccagcagaacacccccatcggcgacggccccg 

tgctgctgcccgacaaccactacctgagctaccagtccgccctgagcaaa 

gaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgaccgc 

cgccgggatcactctcggcatggacgagctgtacaagcaccaccatcacc 

atcac 


FIGURE 27A-2 



U.S. Patent 


Sep. 14, 2010 


Sheet 56 of 62 


US 7,795,388 B2 


MANKW YGLNL FT GN PEDMWKLGVI E PALVKMNAI KAATEAVT LVLR IDDI 
VGGSGGT I PVI I LKE GS SRT YGKEALRAN I AAVKAI EEALKST YGPRGMD 
KILVDSLGDITITNDGATILDKMDLQHPTGKLLVQIAKGQDEETADGTKT 
AVILAGELAKKAEDLLYKEIHPTIIVSGYKKAEEIALKTIQDIAQPVSIN 
DTDVLRKVALT S LGS KAVAGARE Y LADLWKAVAQVAE LRGDKW YVDLDN 
VQIVKKHGGSINDTQLVYGIWDKEWHPGMPKRIENAKIALLDASLEVE 
KPELDAEIRINDPTQMHKFLEEEENILKEKVDKIAATGANWIAQKGIDE 
VAQH YLAKKGI LAVRRAKK S D LE KLARAT GGRV I SNI DELT SQDLGYAAL 
VEERKVGEDKIVFVEGAKNPKSVS I LI RGGLERWDETERALRDALGTVA 
DVIRDGRAVAGGGAVEIEIAKRLRKYAPQVGGKEQLAIEAYANAIEGLIM 
ILAENAGLDPIDKLMQLRSLHENETGSGGSGSGVSKGEELFTGWPILVE 
LDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTFGYG 
LQCFARYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGD 
TLVNRIELKGIDFKEDGNILGHKLEYNGGTGSGGELNSHNVYIMADKQBCN 
GIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSYQSALSK 
DPNEKRDHMVLLE FVTAAGI T LGMDE L YKH H HH HH 


FIGURE 27B 
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ATGGcgagcccggtcttgttattgaaagagggaacgagtagaactactgg 

tagagatgcgttaaggaataatatacttgctgcaaagacactagccgaaa 

tgttaaggagtagtttaggtcctaaaggtcttgataaaatgttaattgat 

agtttcggtgacgtaaccataactaatgatggtgctacaatagtaaagga 

tatggagatacagcatccagcagcaaagctattagtagaagcagctaaag 

cacaagatgctgaagtaggtgatggtactacaagcgctgtagtattagct 

ggtgctctattggagaaagctgaaagtttattggatcaaaatatacatcc 

aacaataattattgaggggtataagaaggcatataccaaggccttggagt 

tacttccacagttaggaactaggatcgatataagggatttgaattcttca 

gttgctagggatactctaagaaaaatagcatttactacactagcaagtaa 

gtttattgcagaaggtgctgaattaaataaaataattgacatggtaatag 

acgcaatagttaatgttgcagaacctctacctaatggtgggtacaatgtg 

agtttagacttaataaagatagataagaagaaaggtggaagtatagagga 

tagcgtcttagttaaaggactagtattagataaggaggttgtgcaccctg 

gaatgcctagaagagtcactaaagccaagatagctgttttggatgcagca 

ttagaggtagaaaagcctgaaatctcagctaagataagtataacatcacc 

agagcaaatcaaggctttcttagatgaggaatccaaatatcttaaggaca 

tggttgataaactagcgtcaataggcgctaacgttgtaatatgccagaaa 

ggtattgatgatatcgcacagcacttcttagctaagaaaggtatattggc 

tgtaagaagggttaagaggagcgatatagagaaattagagaaggcattag 

gcgcgagaataataagcagtattaaagacgctactcccgatgatttagga 

tacgccgaattagttgaggaaaggagagttggaaatgacaaaatggtatt 

tatagaaggtgctaagaatctgaaggccgtgaatatcttgttaagaggtt 

caaatgatatggcattagatgaggctgagaggagtataaatgatgcattg 

catgctctgaggaacatattattagagccagtaatattgccaggtggtgg 

cgctatcgagttagaattagcgatgaaattaagagagtatgctagaagtg 

taggaggtaaggagcaattagctatagaagcatttgcagatgcattagag 

gagatacctatgattttagctgaaactgcagggctggaggctatatctgc 

actaatggacttaagagctagacacgctaagggattaaccaatactggtg 

tagatgtaataggtgggaagatcgtagacgatgtatatgccttaaacatc 

attgagcctataagagtaaaggctcaagtgttaaagagcgcaacagaggc 

ggctacagcaatattaaagattgatgacctaatagctgcagctccattaa 

agagcgagaagaaaggtggagaaggaagtaaagaagaaagtggtggagaa 

ggaggagctggtactccatctttaggagac 


FIGURE 28A 
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MASPVLLLKEGTSRTTGRDALRNNILAAKTLAEMLRSSLGPKGLDKMLID 
S FGDVT I TNDGAT I VKDME IQH PAAKLLVEAAKAQDAEVGDGT T SAWLA 
GALLEKAESLLDQNIHPTIIIEGYKKAYTKALELLPQLGTRIDIRDLNSS 
VARDTLRKIAFTTLASKFI AEGAELNKI I DMVI DAI VNVAE PLPNGGYNV 
SLDLIKIDKKKGGSIEDSVLVKGLVLDKEWHPGMPRRVTKAKIAVLDAA 
LEVEKPE I SAKI S I TS PEQ I KAFLDE ESKYLKDMVDKLAS I GANWICQK 
GI DDIAQH FLAKKGI LAVRRVKRS D I EKLEKALGARI I S S I KDAT PDDLG 
YAELVEERRVGNDKMVFI EGAKNLKAVNI LLRGSNDMALDEAERSINDAL 
HALRNI LLE PVI LPGGGAI E LE LAMKLRE YARS VGGKEQLAIEAFADALE 
E I PMI LAE TAGLEAI SALMDLRARHAKGLTNTGVDVI GGKI VDDVYALNI 
I E PI RVKAQVLKSATEAATAI LKI DDL I AAAPLKS EKKGGE GS KEE SGGE 
GGAGTPSLGD 


FIGURE 28B 
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atggtattcgtagaaggtgcaaagaatccaaaatcagttagtatactaat 

aagaggaggattagagagagtagtagatgagactgaaagagctcttaggg 

acgctttaggtacagtggcagatgtaataagggatggtagagcagtagct 

ggtggtggagctgttgagatagagatagctaagagattaagaaagtatgc 

cccacaagttggtggtaaagagcaattagcaattgaagcatatgctaatg 

caatagaaggacttatcatgatattggcggaaaacgcaggattagatcct 

atagacaaattaatgcaattaagaagtcttcacgagaatgagaccaataa 

atggtatggacttaatttatttactggaaatccagaggatatgtggaaat 

taggtgttattgaaccggcactagtgaaaatgaatgcaattaaggctgca 

acagaagcagtaacattagtgttaagaatagatgatattgtaGGTGGTTC 

TGGTGGTACCatacctgtaataattttaaaagagggatcaagtagaacat 

atggaaaagaagctttaagggctaatattgctgcagtgaaagcaattgaa 

gaggcattaaaaagcacctatggtccacgtggaatggataagatgcttgt 

tgatagcttaggagatattacaataacaaatgatggagccactattcttg 

ataaaatggatttacaacacccaacaggtaagcttttagttcagatagct 

aaaggacaagacgaggaaacagctgatggcactaaaactgctgtaattct 

tgctggagaattagctaaaaaagcagaagatcttttatataaggagattc 

acccaacaataattgtaagcggatataagaaggcagaagaaattgcatta 

aagaccatccaagatatagcacaaccggtcagcataaatgatactgacgt 

acttaggaaagtagcattaacatccttaggcagtaaggcagtagcaggcg 

cacgagagtatttagctgaccttgtggttaaagcagtggcacaagtagca 

gaattaagaggagataagtggtatgttgatctagataatgtacaaatagt 

taaaaaacatggtggtagcattaatgatactcaattagtatacggcatag 

tagttgataaggaagttgtacatccgggcatgccaaagaggattgaaaat 

gctaagatagcccttttagacgcttcattagaagttgagaaacccgaatt 

ggatgcagaaataagaattaac 


FIGURE 29A 
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MVFVEGAKNPKSVSILIRGGLERWDETERALRDALGTVADVIRDGRAVA 
GGGAVEIEIAKRLRKYAPQVGGKEQLAIEAYANAIEGLIMILAENAGLDP 
IDKLMQLRSLHENETNKWYGLNLFTGNPEDMWKLGVIEPALVKMNAIKAA 
TEAVTLVLRI DDI VGGSGGT I PVI I LKEGS SRTYGKEALRANI AAVKAI E 
EALKSTYGPRGMDKMLVDSLGDITITNDGATILDKMDLQHPTGKLLVQIA 
KGQDEETADGTKTAVILAGELAKKAEDLLYKEIHPTIIVSGYKKAEEIAL 
KTIQDIAQPVS INDTDVLRKVALTS LGSKAVAGAREYLADL WKAVAQVA 
ELRGDKWYVDLDNVQIVKKHGGSINDTQLVYGIWDKEWHPGMPKRIEN 
AKIALLDASLEVEKPELDAEIRIN 
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ATGGCAACAGCTACAGTTGCAACTACACCCGAAGGTATACCTGTAATAAT 
TTTAAAAGAGGGATCAAGTAGAACATATGGAAAAGAAGCTTTAAGGGCTA 
ATATTGCTGCAGTGAAAGCAATTGAAGAGGCATTAAAAAGCACCTATGGT 
CCACGTGGAATGGATAAGATGTTCGTTGATAGCTTAGGAGATATTACAAT 
AACAAATGATGGAGCCACTATTCTTGATAAAATGGATTTACAACACCCAA 
CAGGTAAGCTTTTAGTTCAGATAGCTAAAGGACAAGACGAGGAAACAGCT 
GATGGCACTAAAACTGCTGTAATTCTTGCTGGAGAATTAGCTAAAAAAGC 
AGAAGATCTTTTATATAAGGAGATTCACCCAACAATAATTGTAAGCGGAT 
ATAAGAAGGCAGAAGAAATTGCATTAAAGACCATCCAAGATATAGCACAA 
CCGGTCAGCATAAATGATACTGACGTACTTAGGAAAGTAGCATTAACATC 
CTTAGGCAGTAAGGCAGTAGCAGGCGCACGAGAGTATTTAGCTGACCTTG 
TGGTTAAAGCAGTGGCACAAGTAGCAGAATTAAGAGGAGATAAGTGGTAT 
GTTGATCTAGATAATGTACAAATAGTTAAAAAACATGGTGGTAGCATTAA 
TGATACTCAATTAGTATACGGCATAGTAGTTGATAAGGAAGTTGTACATC 
CGGGCATGCCAAAGAGGATTGAAAATGCTAAGATAGCCCTTTTAGACGCT 
TCATTAGAAGTTGAGAAACCCGAATTGGATGCAGAAATAAGAATTAACGA 
TC C AAC AC AG AT GC AC AAAT TC T T GG AAG AAG AAG AAAAC AT AT TGAAAG 
AAAAAGTAGATAAGATTGCAGCTACTGGTGCTAACGTTGTAATATGCCAG 
AAAGGTATCGATGAAGTTGCACAACACTATTTAGCTAAGAAAGGTATATT 
AGCTGTTAGGAGAGCCAAGAAGAGTGATTTAGAGAAATTAGCTAGAGCTA 
CCGGAGGTAGAGTCATATCAAATATTGATGAATTAACTTCACAAGATCTA 
GGTTATGCCGCATTAGTGGAAGAGAGAAAAGTAGGAGAGGATAAGATGGT 
ATTCGTAGAAGGTGCAAAGAATCCAAAATCAGTTAGTATACTAATAAGAG 
GAGGATTAGAGAGAGTAGTAGATGAGACTGAAAGAGCTCTTAGGGACGCT 
TTAGGTACAGTGGCAGATGTAATAAGGGATGGTAGAGCAGTAGCTGGTGG 
TGGAGCTGTTGAGATAGAGATAGCTAAGAGATTAAGAAAGTATGCCCCAC 
AAGTTGGTGGTAAAGAGCAATTAGCAATTGAAGCATATGCTAATGCAATA 
GAAGGACTTATCATGATATTGGCGGAAAACGCAGGATTAGATCCTATAGA 
CAAATTAATGCAATTAAGAAGTCTTCACGAGAATGAGACCAATAAATGGT 
ATGGACTTAATTTATTTACTGGAAATCCAGAGGATATGTGGAAATTAGGT 
GTTATTGAACCGGCACTAGTGAAAATGAATGCAATTAAGGCTGCAACAGA 
AGCAGTAACATTAGTGTTAAGAATAGATGATATTGTAGCAGCTGGAAAGA 
AGGGTGGAAGTGAGCCAGGCGGTAAGAAAGAGAAAGAAGAAAAGTCCTCT 
GAAGAC 
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ATGGCAACAGCTACAGTTGCAACTACACCCGAAGGTATACCTGTAATAAT 
TTTAAAAGAGGGATCAAGTAGAACATATGGAAAAGAAGCTTTAAGGGCTA 
ATATTGCTGCAGTGAAAGCAATTGAAGAGGCATTAAAAAGCACCTATGGT 
CCACGTGGAATGGATAAGATGTTCGTTGATAGCTTAGGAGATATTACAAT 
AACAAATGATGGAGCCACTATTCTTGATAAAATGGATTTACAACACCCAA 
CAGGTAAGCTTTTAGTTCAGATAGCTAAAGGACAAGACGAGGAAACAGCT 
GATGGCACTAAAACTGCTGTAATTCTTGCTGGAGAATTAGCTAAAAAAGC 
AGAAGATCTTTTATATAAGGAGATTCACCCAACAATAATTGTAAGCGGAT 
ATAAGAAGGCAGAAGAAATTGCATTAAAGACCATCCAAGATATAGCACAA 
CCGGTCAGCATAAATGATACTGACGTACTTAGGAAAGTAGCATTAACATC 
CTTAGGCAGTAAGGCAGTAGCAGGCGCACGAGAGTATTTAGCTGACCTTG 
TGGTTAAAGCAGTGGCACAAGTAGCAGAATTAAGAGGAGATAAGTGGTAT 
GTTGATCTAGATAATGTACAAATAGTTAAAAAACATGGTGGTAGCATTAA 
TGATACTCAATTAGTATACGGCATAGTAGTTGATAAGGAAGTTGTACATC 
CGGGCATGCCAAAGAGGATTGAAAATGCTAAGATAGCCCTTTTAGACGCT 
TCATTAGAAGTTGAGAAACCCGAATTGGATGCAGAAATAAGAATTAACGA 
TCCAACACAGATGCACAAATTCTTGGAAGAAGAAGAAAACATATTGAAAG 
AAAAAGTAGATAAGATTGCAGCTACTGGTGCTAACGTTGTAATATGCCAG 
AAAGGTATCGATGAAGTTGCACAACACTATTTAGCTAAGAAAGGTATATT 
AGCTGTTAGGAGAGCC AAGAAG AG T G AT TT AGAGAAAT TAGC TAGAGC TA 
CCGGAGGTAGAGTCATATCAAATATTGATGAATTAACTTCACAAGATCTA 
GGTTATGCCGCATTAGTGGAAGAGAGAAAAGTAGGAGAGGATAAGATGGT 
ATTCGTAGAAGGTGCAAAGAATCCAAAATCAGTTAGTATACTAATAAGAG 
GAGGATTAGAGAGAGTAGTAGATGAGACTGAAAGAGCTCTTAGGGACGCT 
TTAGGTACAGTGGCAGATGTAATAAGGGATGGTAGAGCAGTAGCTGGTGG 
TGGAGCTGTTGAGATAGAGATAGCTAAGAGATTAAGAAAGTATGCCCCAC 
AAGTTGGTGGTAAAGAGCAATTAGCAATTGAAGCATATGCTAATGCAATA 
GAAGGACTTATCATGATATTGGCGGAAAACGCAGGATTAGATCCTATAGA 
CAAATTAATGCAATTAAGAAGTCTTCACGACAATGAGACCAATAAATGGT 
ATGGACTTAATTTATTTACTGGAAATCCAGAGGATATGTGGAAATTAGGT 
GTTATTGAACCGGCACTAGTGAAAATGAATGCAATTAAGGCTGCAACAGA 
AGCAGTAACATTAGTGTTAAGAATAGATGATATTGTAGCAGCTGGAAAGA 
AGGGTGGAAGTGAGCCAGGCGGTAAGAAAGAGAAAGAAGAAAAGTCCTCT 
GAAGAC 


FIG 


31 



US 7,795,388 B2 


1 

VERSATILE PLATFORM FOR 
NANOTECHNOLOGY BASED ON CIRCULAR 
PERMUTATIONS OF CHAPERONIN 
PROTEIN 

This application is a continuation-in-part of U.S. Ser. No. 
10/494,853, which is based on PCT/US02/35889, filed Nov. 
8, 2002, which claims priority of a provisional application, 
U.S. Ser. No. 60/340,538, filed Nov. 8, 2001, the contents of 
all of which are hereby incorporated by reference in their 
entirety into this application. 

Throughout this application various publications are refer- 
enced. The disclosures of these publications in their entireties 
are hereby incorporated by reference into this application in 
order to more fully describe the state of the art to which this 
invention pertains. 

The work described here was supported, at least in part, by 
grants from: The National Aeronautics and Space Adminis- 
tration. The United States government may, therefore, have 
certain rights in the invention. 

FIELD OF INVENTION 

The present invention relates to the field of bio-nanotech- 
nology. The present invention provides compositions and 
methods for making chaperonin subunit polypeptides which 
are modified by relocating the N-terminal and C-terminal to 
different positions. The modified chaperonin subunit 
polypeptides can assemble as part of chaperonin structures, 
and the relocated N- and C-terminal ends can be linked to or 
bind with organic or inorganic molecules. The assembled 
chaperonin structures are useful for producing nanoscale fila- 
ments or two-dimensional crystals having functions pro- 
duced by the linked or bound organic or inorganic molecule. 

BACKGROUND 

The controlled organization of inorganic materials into 
multi-dimensional addressable arrays is the foundation for 
both logic and memory devices, as well as other nonlinear 
optical and sensing devices (Zhirnov et al., 2001 Computer 
34: 34-43; Xiaetal, 2000 Adv. Mater. 12: 693-713). Many of 
these devices are currently fabricated using lithographic pat- 
terning processes that have progressively developed toward 
greater integration densities and smaller sizes. At submicron 
scales, however, conventional lithographic processes are 
approaching their practical and theoretical limits. At scales 
below 100 nm, ion and electron beam lithography becomes 
prohibitively expensive and time consuming, and more 
importantly, at these scales quantum effects fundamentally 
change the properties of devices (Sato et al., 1997 J. Appl. 
Phys. 82: 696). 

Nanoscale templates for constrained synthesis, in situ 
deposition, or direct patterning of nanometer scale inorganic 
arrays are being developed using both artificial and natural 
materials. Artificial materials such as microphase separated 
block copolymers (Park et al., 2001 Appl. Phys. Lett. 79: 
257-259) and hexagonally close-packed spheres (Hulteen et 
al., 1995 J. Vac. Sci. Technol. A, 1553-1558) have been used 
for nano scale fabrication. Natural materials such as DNA 
(Richter et al., 2000 Adv. Mater. 12: 507-510; Keren et al., 
2002 Science 297: 72-75), bacterial and archaeal surface 
layer proteins (S-layer proteins) (Sleytr et al., 1999 Angew. 
Chem. Int. Ed. 38: 1034-1054; Douglas et al., Appl. Phys. 
Lett. 48: 676-678; Hall et al., 2001 CHEMPHYSCHEM 3: 
1 84- 186), virus capsids (Shenton et al., 1 999 Adv. Mater. 1 1 : 
253-256; Douglas et al., 1999 Adv. Mater., 679-681; Douglas 
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et al., Nature 393: 152-155; Wang et al., 2002 Angew. Chem. 
Int. Ed. 41: 459-462), phage (Lee et al., 2002 Science 296: 
892-895), and some globular proteins (Yamashita, I., 2001 
Thin Solid Films 393 : 1 2-1 8) have been used as templates and 
5 in other nanoscale applications. 

Various nanometer scale objects, including arrays of nano- 
particles formed by non-conventional methods are being 
explored for use as viable alternatives to standard lithographi- 
cally patterned devices. Individual nanoparticles, also known 
to as quantum dots (QDs), have been shown to behave as iso- 
lated device components such as single electron transistors 
(Likharev, K. K., 1999 Proc. IEEE 87: 606-632; Thelander et 
al., 2001 Appl. Phys. Lett. 79: 2106-2108). Theoreticians 
have postulated that two-dimensional arrays of QDs with 
15 nanoscale resolution could form the basis of future genera- 
tions of electronic and photonic devices. The function of these 
devices will be based on phenomena such as coulomb charg- 
ing, inter-dot quantum tunneling and other coherent proper- 
ties derived from the electronic consequences of confinement 
20 and nanoparticle surface area to volume ratio s (Maier, S . A. et 
al., 2001 Adv. Mater. 13: 1501-1505; Maier et al., Phys. Rev. 
B 65, 193408; Zrenner, A. et al., 2002 Nature 418: 612-614; 
Berven et al., 2001 Adv. Mater. 13: 109-113). 

Traditional techniques for patterning ordered arrays of 
25 materials onto inorganic substrates and manufacturing 
devices currently used are ion beam lithography and molecu- 
lar beam epitaxy. These techniques possess inherent limita- 
tions due to the use of polymeric light masks for pattern 
formation, however, there is a theoretical limitation of pat- 
30 teming that could ultimately limit the processes in the hun- 
dreds of nanometers. 

While there are strong incentives to develop nanoscale 
architectures, these developments require alternate fabrica- 
tion methods and new insights into the behavior of materials 
35 onnanometer scales (Nalwa, H. S, 2000 “Handbook of Mate- 
rials and Nanotechnology”, Academic Press, San Diego). 

Development of methods for ordering nanoscale materials 
through “bottom up” assembly will provide new tools for 
creating nano structured materials and devices that self-as- 
40 semble or self-repair. Synthetic and biological polymers have 
gained attention because of their inherent ability to form 
structures on the nanometer scale with little or no mechanical 
processing. Self-assembly and phase separation of these 
natural or synthetic polymers have been successfully used for 
45 nanoscale ordering of materials. Biopolymers form espe- 
cially well-defined structures and assemblies with highly spe- 
cific chemical functionalities. Nucleic acids (J Richter, et al., 
2000 Advanced Materials 12:507-510; M G Warner and J E 
Hutchison 2003 Nature Materials 2:272-277; and K Keren, et 
50 al., 202 Science 297:72-75), proteins (K Douglas and N A 
Clark 1986 Appl Phys Lett 48:676-678; U B Sleytr, et al., 
1 999 Angew Chem Int Edn 38: 1 034-1054; I Yamashita 2001 
Thin Solid Films 393:12-18; M Allen, et al, 2002 14:1562- 
1565; R A McMillan, et al, 2002 Nature Materials 1:247- 
55 252), virions and virus capsids (W Shenton, et al, 1 999 Adv 
Mater 11:253-256; S-W Lee, et al, 2002 Science 296:892- 
895; Q Wang, et al, 2002 Angew Chem Int Ed Engl 41 :459- 
462) have all been used to create nano structured materials 
with unique properties. 

60 A number of protein complexes have been developed as 
nanoscale templates. These templates can be functionalized 
by genetic modification to add chemically reactive sites that 
bind inorganic materials. For example, chaperonin com- 
plexes can be functionalized to bind soft metals. In nature, 
65 chaperonins are protein complexes having two stacked rings 
each comprising 7, 8 or 9 HSP60 subunits. The HSP60 sub- 
units were mutated to include single cysteine residues placed 
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at different solvent-exposed sites, including the apical pore. 
The thiols of these cysteine residues provide binding sites for 
gold or zinc (PCT/US02/35889). The chaperonin complexes 
comprising these mutant HSP60 subunits bind gold or zinc 
and assemble into two-dimensional crystals. 5 

Protein complexes can also be modified to include peptide 
sequences having desirable binding or catalytic functions. 
These protein complexes comprise subunits having inserted 
peptide sequences. However, the mutant subunits may fail to 
fold, assemble into complexes or organize into higher-order to 
structures. Furthermore, insertion as a loop may render the 
peptide sequence inactive and fusion to one of the native 
termini may not provide sufficient surface accessibility. To 
overcome this challenge, circular permutation has been used 
to join peptide sequences within a protein template. Circular 15 
permutation is a reordering of the polypeptide chain such that 
the original N- and C-terminal ends are joined and new ter- 
mini are created elsewhere in the protein. New peptide 
sequences can be joined to either of the new termini without 
perturbing subunit assembly. Published studies of protein 20 
circular permutation demonstrate that, for proteins in which 
the native amino and carboxyl termini are near in space, many 
relocated positions for the new termini are viable (P T Beer- 
nink, et al., 2001 Protein Sci 10:528-537; U Heinemann and 
M Hahn 1995 Prog Biophys Mol Biol 64:121-143; M 25 
Iwakura, et al., 2000 Nat Struct Biol 7:580-585). 

The present invention provides chaperonin subunit 
polypeptides which are modified to relocate the native N-ter- 
minal and C-terminal ends from the central pore region to 
various new positions on the exterior of the folded modified 30 
chaperonin polypeptide. The relocated N- and C-terminal 
ends are joined with a peptide sequence that behaves as a 
reporter. The modified chaperonin polypeptides fold into sub- 
units that self-assemble into double-ringed chaperonin struc- 
tures, and the chaperonin structures organize into higher 35 
order structures such as two-dimensional crystals and fila- 
ments. Additionally, the reporter peptide is functional. These 
chaperonin structures are useful for producing ordered nanos- 
cale materials and devices. 

40 

SUMMARY 

The present invention provides chaperonin polypeptides 
which are modified to include N-terminal and C-terminal 
ends that are relocated from the central pore region to various 45 
different positions in the polypeptide which are located on the 
exterior of the folded modified chaperonin polypeptide. 

The relocated N-terminal or C-terminal ends can be 
covalently joined to, or bound with, a nucleic acid molecule, 
a lipid, a carbohydrate, a second polypeptide, or a nanopar- 50 
tide. 

In the modified chaperonin polypeptide, the naturally-oc- 
curring N-terminal and C-terminal ends are joined together 
directly or with an intervening linker peptide sequence. In one 
embodiment, the intervening linker sequence comprises the 55 
amino acid sequence Gly-Gly-Ser-Gly-Gly-Thr. 

The modified chaperonin polypeptide is based on a Group 
I or Group II chaperonin polypeptide. 

The Group I chaperonin polypeptides are from Escheri- 
chia, Cyanobacteria, Mycobacteria, Coxiella, Rickettsia, 60 
Chlamydia, Thermotoga , chloroplast, mammalian mitochon- 
dria, or yeast mitochondria. The Group II chaperonin 
polypeptides are from Sulfolobales, Methanopyrus , Pyrodic- 
tium, Thermoplasma, Thermoplasma, Thermococus, Pyroco- 
cus, Mathanococus , or yeast cytosol. 65 

In one embodiment, the modified chaperonin polypeptide 
from Sulfolobus is an alpha, beta, or gamma polypeptide. 
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In one embodiment, the modified chaperonin polypeptide 
comprises a Sulfolobus shibatae TF55 beta polypeptide com- 
prising the N-terminal and C-terminal ends relocated after 
any amino acid position in the range 149-158, in particular 
relocated to position 153. 

In another embodiment, the modified chaperonin polypep- 
tide comprises a Sulfolobus shibatae TF55 beta polypeptide 
comprising the N-terminal and C-terminal ends relocated 
after any amino acid position in the range 263-270, in par- 
ticular relocated to position 267. 

In another embodiment, the modified chaperonin polypep- 
tide comprises a Sulfolobus shibatae TF55 beta polypeptide 
comprising the N-terminal and C-terminal ends relocated 
after any amino acid position in the range 311-330, in par- 
ticular relocated to position 316. 

In another embodiment, the modified chaperonin polypep- 
tide comprises a Sulfolobus shibatae TF55 beta polypeptide 
comprising the N-terminal and C-terminal ends relocated 
after any amino acid position in the range 472-487, in par- 
ticular relocated to position 480. 

In yet another embodiment, the modified chaperonin 
polypeptide comprises a Sulfolobus shibatae TF55 beta 
polypeptide comprising the N-terminal and C-terminal ends 
relocated after any amino acid position in the range 494-508, 
in particular relocated to position 499. 

The present invention also provides assembled chaperonin 
structures, comprising at least one of the modified chaperonin 
subunits of the invention. 

The assembled chaperonin structures can have one or more 
of 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, and 10-fold symmetry. 

The present invention further provides: nucleic acid mol- 
ecules encoding the modified chaperonin polypeptide of the 
invention; vectors comprising these nucleic acid molecules; 
and host cells carrying these vectors. 

The present invention provides in vivo methods for pro- 
ducing modified chaperonin polypeptides, comprising cultur- 
ing the host/vector systems under conditions suitable for the 
host to produce the modified chaperonin polypeptide. 

The present invention also provides in vitro methods for 
producing the assembled chaperonin structures of the inven- 
tion, comprising reacting the modified chaperonin polypep- 
tides under conditions suitable for self-assembly of the modi- 
fied chaperonin polypeptide into the assembled chaperonin 
structures. 

The present invention provides chaperonin structures that 
organize into higher order structures such as nanofilaments or 
nanoarrays. These nanofilaments and nanoarrays can be used 
to produce nanodevices or nanocoatings. 

BRIEF DESCRIPTION OF FIGURES 

FIG. 1 illustrates an end and side view of a model of an 
HSP60 chaperonin at 2.3 A resolution. The outlined region of 
the side view shows a single subunit of HSP60. 

FIGS. 2A-2R show the protein sequence alignment of S. 
shibatae TF55 beta subunit (SEQ ID NO: 1), bacterial £. coli 
GroEL (SEQ ID NO: 2), thermosome T. acidophilum beta 
subunit (SEQ ID NO:3), cyanobacterial synechococcus 
HSP60 (SEQ ID NO:4), M. acetivorans HSP60-4 (SEQ ID 
NO:5),M. tuberculosis HSP65 (SEQ ID NO:6), thermosome 
A. pernix alpha subunit (SEQ ID NO:7), thermosome M. 
mazei alpha subunit (SEQ ID NO:8), mitochondrial A. 
thaliana HSP60 (SEQ ID NO: 9), yeast TCP1 alpha subunit 
(SEQ ID NO: 10), human mitochondrial HSP60 (SEQ ID 
NO: 11), mouse mitochondrial HSP60 (SEQ ID NO: 12), 
human TCP1 alpha subunit (SEQ ID NO: 13), mouse TCP1 
alpha subunit (SEQ ID NO: 14), and the consensus (SEQ ID 
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NO: 15). Identical residues are enclosed in a dot-dashed box, 
blocks of similar residues are enclosed in a solid box, and 
conservative matches are enclosed in a dashed box. 

FIG. 3 shows a structural alignment of the archaeal chap- 
eronin (thermo some) and the bacterial chaperonin (GroEL), 
indicating the structural similarities between group I and 
group II chaperonins. The black areas of the structural align- 
ment indicate where the features of the two chaperonin sub- 
units overlap. 

FIG. 4 shows the detailed structure of a Group II chapero- 
nin subunit. 

FIGS. 5A-D shows individual HSP60 (heat-shock protein) 
chaperonins and filaments as observed in the electron micro- 
scope. 

FIGS. 6A and 6B show the organization of HSP60 rings 
into 2 -dimensional crystals on a metal grid coated with lipid 
(6A) and filament bundles arranged on a bed of rings (visible 
as spots in background) (6B). 

FIGS. 7A-E show the assembly of engineered HSP60s 
(heat-shock proteins) into nanotemplates for the production 
of nanoarrays comprising nanoscale materials such as nano- 
particles. 

FIGS. 8A-D show gold nanoparticles binding to engi- 
neered chaperonins and chaperonin nano templates. 

FIGS. 9A-D show semiconductor QD nanoarrays. 

FIGS. 10A-D show the formation of a nanoarray of gold 
nanoparticles. FIG. 10(D) shows XEDS spectra of bare car- 
bon film (solid line) and the gold nanoparticle nanoarray 
(dashed line) from the probed area outlined by a circle in FIG. 
10(B), as indicated by the arrow. 

FIGS. 11A-C show HAADF STEM imaging of a nanogold 
array. 

FIG. 12 shows a control experiment showing DIC (left) and 
fluorescent (right) images of non-cys-mutated chaperonin 
crystals after incubation with CdSe — ZnS QDs. 

FIG. 13 shows an Energy Filtered TEM thickness map of a 
typical 2D protein crystal. 

FIG. 14 illustrates steps in the formation of an ordered 
nanoarray of nanoparticles on a substrate. 

FIG. 15 shows the protein sequence alignment of S. shiba- 
tae TF55 alpha subunit (SEQ ID NO: 39), beta subunit (SEQ 
ID NO: 1) and gamma subunit (SEQ ID NO: 38). 

FIGS. 16A and 16B show the DNA sequence (SEQ ID NO: 
37) and amino-acid sequence for S. shibatae gamma subunit 
(SEQ ID NO: 38). 

FIGS. 17A-E show models of circular permutant proteins 
at position: A) 153, B) 267, C) 316, D) 480, and E) 499. The 
left column shows ribbon representations of protein subunits. 
The center column shows side views of surface representa- 
tions of the assembled rings. The right column shows top 
views of the surface representations of the assembled rings. 
Amino termini are labeled in blue. Carboxyl termini are 
labeled in red. The flexible linker sequences are labeled in 
green. In the assembled rings, a single subunit is highlighted 
in yellow. 

FIG. 18 shows a native polyacrylamide gel electrophoresis 
of wild-type chaperonin and circular permutants as unas- 
sembled protein subunits and assembled double rings in the 
presence and absence of ATP and magnesium ions. 

FIGS. 19A-J shows transmission electron microscopy 
(TEM) of assembled chaperonin double rings comprising 
circular permutant chaperonin proteins (lacking fused 
EYFP). Samples containing isolated double rings, shown in 
panels A-E were prepared 1 hour after addition of ATP and 
MgCl 2 to the protein samples. Samples containing higher- 
order structures, shown in panels F-J were prepared 24 hours 
after addition of ATP and Mg 2+ to the protein samples. A) 
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Circular permutant at position 153 at 1 hour. F) Circular 
permutant at position 153 at 24 hours. B) Circular permutant 
at position 267 at 1 hour. G) Circular permutant at position 
267 at 24 hours. C) Circular permutant at position 316 at 1 
5 hour. H) Circular permutant at position 316 at 24 hours. D) 
Circular permutant at position 480 at 1 hour. J) Circular 
permutant at position 480 at 24 hours. E) Circular permutant 
at position 499 at 1 hour. J) Circular permutant at position 499 
at 24 hours. 

l o FIG. 2 0A-D shows circular permutant chaperonin proteins 

at positions 267 and 480 joined with yellow fluorescent pro- 
tein (EYFP). A) Fluorescence excitation (open symbols) and 
emission (closed symbols) spectra for unmodified EYFP 
(circles), 267 permutant-EYFP fusion protein (squares), 480 
1 5 permutant-EYFP fusion protein (triangles) . B) TEM showing 
negatively stained rings formed by 480 permutant-EYFP 
fusion protein. C) Fluorescence microscopy of a crystal 
formed by the 267 permutant-EYFP fusion protein using 
Leica filter cube 1 3 (excitation 470 nm, emission 515 nm) . D) 
20 TEM showing 2-dimensional crystal formed by 267 permu- 
tant-EYFP fusion protein. 

FIG. 21 shows the full (A) DNA (SEQ ID NO: 66) and (B) 
amino acid (SEQ ID NO: 67) sequence of a position 153 
circular permutant chaperonin protein. 

25 FIG. 22 shows the full (A) DNA (SEQ ID NO: 68) and (B) 
amino acid (SEQ ID NO: 69) sequence of a position 267 
circular permutant chaperonin protein. 

FIG. 23 shows the full (A) DNA (SEQ ID NO:70) and (B) 
amino acid (SEQ ID NO:71) sequence of a position 316 
30 circular permutant chaperonin protein. 

FIG. 24 shows the full (A) DNA (SEQ ID NO:72) and (B) 
amino acid (SEQ ID NO:73) sequence of a position 480 
circular permutant chaperonin protein. 

FIG. 25 shows the full (A) DNA (SEQ ID NO:74) and (B) 
35 amino acid (SEQ ID NO:75) sequence of a position 499 
circular permutant chaperonin protein. 

FIG. 26 shows the full (A) DNA (SEQ ID NO: 76) and (B) 
amino acid (SEQ ID NO:77) sequence of a position 267 
circular permutant chaperonin-EYFP fusion protein. 

40 FIG. 27 shows the full (A) DNA (SEQ ID NO:78) and (B) 
amino acid (SEQ ID NO:79) sequence of a position 480 
circular permutant chaperonin-EYFP fusion protein. 

FIG. 28 shows the full (A) DNA (SEQ ID NO: 80) and (B) 
amino acid (SEQ ID NO: 1) sequence ofwild-typeTF5 5 alpha 
45 subunit from Sulfolobus shibatae. 

FIGS. 29A and 29B show the full (A) DNA (SEQ ID 
NO:81) and (B) amino acid (SEQ ID NO: 82) sequence of a 
position 267 “dwarf’ deletion circular permutant chaperonin 
protein. 

50 FIG. 30 shows the DNA sequence (SEQ ID NO: 85) of 
TF55 beta subunit from Sulfolobus shibatae. 

FIG. 31 shows a different DNA sequence (SEQ ID NO: 86) 
of TF55 beta subunit from Sulfolobus shibatae. 

55 DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

All scientific and technical terms used in this application 
60 have meanings commonly used in the art unless otherwise 
specified. As used in this application, the following words or 
phrases have the meanings specified. 

As used herein, the term “isolated” refers to a specific 
nucleic acid or polypeptide, or a fragment thereof, in which 
65 contaminants (i.e. substances that differ from the specific 
nucleic acid or polypeptide molecule) have been separated 
from the specific nucleic acid or polypeptide. 
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As used herein, the term “purified” refers to a specific 
isolated nucleic acid or polypeptide, or a fragment thereof, in 
which substantially all contaminants (i.e. substances that dif- 
fer from the specific nucleic acid or polypeptide molecule) 
have been separated from the specific nucleic acid or 
polypeptide. 

As used herein, the term “naturally -occurring” refers to a 
nucleic acid or polypeptide molecule found in nature. 

As used herein, the term “wild type” refers to a nucleic acid 
or polypeptide molecule having the same nucleotide and/or 
amino acid sequence as a naturally-occurring, non-mutant 
molecule, respectively. 

As used herein, the term “modified” refers to molecules 
with amino acid or nucleotide sequences differing (mutated) 
from a naturally-occurring i.e., wild-type, amino acid or 
nucleotide, sequence. The modified molecules can retain the 
same structural properties as a wild-type molecule. 

As used herein, the term “derivative” means any modifica- 
tion or alteration of a wild-type molecule. Derivatives 
include, but are not limited to: a substitution, conservative or 
non-conservative, in a amino acid and/or nucleotide sequence 
including substitutions by other amino acids, nucleotides, 
amino acid analogs or nucleotide analogs; a deletion of one or 
more amino acids and/or nucleotides; an insertion of one or 
more amino acids and/or nucleotides; and pre- and/or post- 
translational modifications. A derivative molecule can share 
sequence similarity and/or activity with its parent molecule. 

As used herein, a first nucleotide or amino acid sequence is 
said to have sequence “identity” to a second nucleotide or 
amino acid sequence, respectively, when a comparison of the 
first and the second sequences shows that they are exactly 
alike. 

As used herein, a first nucleotide or amino acid sequence is 
said to be “similar” to a second sequence when a comparison 
of the two sequences shows that they have few sequence 
differences (i.e., the first and second sequences are nearly 
identical). For example, two sequences are considered to be 
similar to each other when the percentage of nucleotides or 
amino acids that differ between the two sequences can be 
between about 60% to 99.99%. 

As used herein, the term “complementary” refers to nucleic 
acid molecules having purine and pyrimidine nucleotide 
bases which have the capacity to associate through hydrogen 
bonding to form base pairs thereby mediating formation of 
double stranded nucleic acid molecules. The following base 
pairs are related by complementarity: guanine and cytosine; 
adenine and thymine; and adenine and uracil. Complemen- 
tary applies to all base pairs comprising two single- stranded 
nucleic acid molecules, or to all base pairs comprising a 
single- stranded nucleic acid molecule folded upon itself. 

As used herein, the term “conservative” refers to substitut- 
ing an amino acid residue for a different amino acid residue 
that has similar chemical properties. A conservative amino 
acid substitution includes: substituting any hydrophobic 
(e.g., nonpolar) amino acid for any other hydrophobic amino 
acid; or substituting any hydrophilic (polar, uncharged) 
amino acid for any other hydrophilic amino acid; or substi- 
tuting any positively charged amino acid for any other posi- 
tively charge amino acid; or substituting any negatively 
charge amino acid for any other negatively charged amino 
acid (TE Creighton, “Proteins” WH Freeman and Company, 
New York). The amino acid substitutions include, but are not 
limited to, substituting any of isoleucine (I), valine (V), and 
leucine (L) for any other of these hydrophobic amino acids; 
aspartic acid (D) for glutamic acid (E) and vice versa; 
glutamine (Q) for asparagine (N) and vice versa; and serine 
(S) for threonine (T) and vice versa. Other substitutions can 
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also be considered conservative, depending on the environ- 
ment of the particular amino acid and its role in the three- 
dimensional structure of the protein. For example, glycine 
(G) and alanine (A), or glycine (G) and serine (S) can fre- 
5 quently be interchangeable, as can alanine (A) and valine (V) . 
Methionine (M), which is relatively hydrophobic, can fre- 
quently be interchanged with leucine and isoleucine, and 
sometimes with valine. Lysine (K) and arginine (R) are fre- 
quently interchangeable in locations in which the significant 
l o feature of the amino acid residue is its charge and the differing 
pK’s of these two amino acid residues are not significant. Still 
other changes can be considered conservative in particular 
environments. 

As used herein, the term “nonconservative” refers to sub- 
15 stituting an amino acid residue for a different amino acid 
residue that has different chemical properties. The noncon- 
servative substitutions include, but are not limited to aspartic 
acid (D) being replaced with glycine (G); asparagine (N) 
being replaced with lysine (K); or alanine (A) being replaced 
20 with arginine (R). 

The single-letter codes for amino acid residues include the 
following: A=alanine, R=arginine, N=asparagine, 

D=aspartic acid, C=cysteine, Q=Glutamine, E=Glutamic 
acid, G=glycine, H=histidine, I=isoleucine, L=leucine, 
25 K=lysine, M=methionine, F=phenyl alanine, P=proline, 
S=serine, T=threonine, W=tryptophan, Y=tyrosine, 
V=valine. 

In order that the invention herein described can be more 
fully understood, the following description is set forth. 

The Modified Chaperonin Polypeptides 

In its various aspects, the present invention provides : modi- 
fied chaperonin polypeptides, and fragments and derivatives 
thereof; assembled chaperonins comprising the modified 
chaperonin polypeptides of the present invention; nucleic 
acid molecules encoding the modified chaperonin polypep- 
tides, or fragments and derivatives thereof; recombinant DNA 
molecules; transformed host cells; host-vector systems; 
methods for producing the modified chaperonin polypep- 
40 tides; methods for using the modified chaperonin polypep- 
tides including producing bio-nano scale devices and coat- 
ings. 

In nature, chaperonins are ubiquitous and essential subcel- 
45 lular structures having 14, 16, or 18 heat shock protein sub- 
units (e.g., HSP60), arranged as two stacked rings and mea- 
suring approximately 1 6 to 1 8 nm tall by approximately 1 5 to 
17 nm wide, depending on their species of origin (Hard et al., 
2002 Science 295: 1 852-8). Chaperonins have a central pore, 
5Q as well as apical, intermediate and equatorial domains. In a 
wild-type HSP60 protein subunit, the naturally-occurring N- 
and C-terminal ends occur in a region that forms part of the 
central pore in the assembled chaperonin. FIG. 1 illustrates an 
end and side view of a chaperonin that comprises 1 6 subunits, 
55 i.e., eight subunits per ring. 

The present invention provides chaperonin subunit 
polypeptides, and fragments or derivatives thereof, which are 
modified to include N-terminal and C-terminal ends that are 
relocated from their naturally -occurring positions to different 
60 positions in the polypeptide. 

In the modified chaperonin polypeptide, the N- and C-ter- 
minal ends are relocated from the central pore region to a new 
position on the exterior of the folded modified chaperonin 
polypeptide. The positions of the relocated N- and C-terminal 
65 ends are selected to circumvent the space constraints of the 
central pore, thereby permitting joining or binding other mol- 
ecules to the modified chaperonin polypeptide. 
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The modified chaperonin polypeptides of the present 
invention are mutant chaperonin polypeptides. The modified 
chaperonin polypeptides exhibit the same structural proper- 
ties as naturally-occurring wild-type chaperonin subunit 
polypeptides, which includes folding into a subunit structure 
having the same or similar shape compared to a wild-type 
chaperonin subunit polypeptide. The folded modified chap- 
eronin subunit structure can self-assemble with other chap- 
eronin subunit structures (folded wild-type and/or modified 
subunit polypeptides) to form a double-ringed chaperonin 
structure. The folded modified chaperonin subunit structure 
can self-assemble in vivo or in vitro to form a double-ringed 
chaperonin structure. The double-ringed chaperonin struc- 
ture, comprising at least one modified polypeptide of the 
present invention, can be organized into two-dimensional 
crystals or filaments which are useful for producing nanode- 
vices such as logic and memory devices, non-linear optical 
devices, and sensing devices. 

The present invention provides modified chaperonin 
polypeptides, or fragments or derivatives thereof, derived or 
isolated from any source whether natural, synthetic, semi- 
synthetic, or recombinant. 

The terms “modified chaperonin subunit polypeptides”, 
“modified chaperonin polypeptides”, and “modified polypep- 
tides” refer to chaperonin subunit polypeptides which are 
modified by relocating the N- and C-terminal ends. These 
terms are used interchangeable herein. 

Group I and II Chaperonin Polypeptides 

The modified polypeptides can be produced using any 
Group I or Group II chaperonin polypeptide. Chaperonins 
have been classified into two groups, Group I and Group II, 
based on sequence and structural comparisons. (See, e.g., 
Trent et al., 1991 Nature 354: 490-493; Horwich et al., 1993 
Phil. Trans R. Soc. Lond. 339: 313-326). 

Group I chaperonins are from bacteria and the bacterial- 
derived organelles of Eukarya (mitochondria and chloro- 
plasts), while Group II chaperonins are from Archaea and 
eukaryotic cytosol. A description of the expression of endog- 
enous, wild- type TF55 Sulfolobus shibatae , and a compari- 
son of a Group I chaperonin (GroEL) to the Group II chap- 
eronin TF55 is disclosed by Trent (U.S. Pat. No. 5,428,131). 

Any of the Group I chaperonin subunit proteins can be used 
to produce the modified chaperonin polypeptides of the 
present invention. Wild-type Group I chaperonins are com- 
posed of seven subunits in each of the two rings of the double- 
ring structure. The wild-type cpn60 subunit proteins, which 
comprise about 550 to about 580 amino acid residues, have 
been described by different names in different species, 
including, but not limited to Escherichia coli GroEL protein, 
Cyanobacterial groEL analogues, Mycobacterium tubercu- 
losis and leprae 65 Kd antigen, Coxiella burnetti heat shock 
protein B (gene htpB), Rickettsia tsutsugamushi major anti- 
gen 58, Chlamydial 57 Kd hypersensitivity antigen (gene 
hypB), Chloroplast RuBisCO subunit binding-protein alpha 
and beta chains, Mammalian mitochondrial matrix protein PI 
(mitonin or P60), and yeast HSP60 protein. 

In one embodiment, e.g., when utilizing Group I chaper- 
onins, chaperonin polypeptides, and/or mutant chaperonins 
and/or mutant chaperonin polypeptides, a co-chaperonin can 
be utilized in forming the higher order structures of the inven- 
tion. As such, in one example of such an embodiment, a 
composition or device of the invention further comprises a 
co-chaperonin. Co -chaperonins are well known to those of 
skill in the art (Harris et al., 1995 J. Structural Biol. 115: 
68-77). In another, non-limiting example of such an embodi- 
ment, a co-chaperonin can be utilized in producing nanofila- 
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ments. For example, the cpn60 in the bacterium E. coli 
(GroEL) in nature is associated with a single ring structure 
composed of 10 kDa proteins (co-chaperonin or cpnlO) 
called “GroES.” As such, a GroES polypeptide represents an 
5 exemplary, non-limiting species of co-chaperonin that can be 
utilized in conjunction with Group I chaperonins, e.g., GroEL 
or GroEL-derived chaperonins, chaperonin polypeptides, 
and/or mutant chaperonins or chaperonin polypeptides. In 
different embodiments of the invention, the compositions, 
to e.g., nanotemplates or nano structures , are formed from one or 
more chaperonins with the co-chaperonin on one or both ends 
of the chaperonin. 

Any of the Group II chaperonins subunit proteins can be 
used to produce the modified chaperonin polypeptides of the 
15 present invention. Group II chaperonins are composed of 
identical or diverse subunits arranged in rings of eight or nine 
subunits, depending on the organism. In the yeast Saccharo- 
myces cerevisiae , for example, there is evidence for eight 
different subunits in each ring (Lin et al., 1997 Proc. Natl. 
20 Acad. Sci. USA 94: 107 80- 107 85). Among the Archaea some 
thermophilic methanogens (e.g., Methanopyrus kandleri, 
Methanococcus jannaschii, Methanococcus therm- 
olithotrophicus) have chaperonins with identical subunits 
(Furutani et al., 1998 J. Biol. Chem. 273: 28399-28407), 
25 while in the mesophilic methanogen Methanosarcina ace- 
tivorans there are five different subunits (Galagan et al., 2002 
Genome Research 12: 532-542). Of the 50 archaeal chapero- 
nin sequences in the databases most have >40% amino acid 
sequence identity. 

30 The majority of Group II chaperonins in Archaea have 
eight subunits per ring and are referred to as “thermo somes” 
(Klumpp, M., and Baumeister, W., 1998, FEBS Letters 430: 
73-77), but the chaperonins in the thermoacidophilic Archaea 
in the family Sulfolobales have nine subunits per ring (Trent 
35 et al., 1991 Nature 354: 490-493; Marco et al., 1994 FEBS 
341: 152-155). These Sulfolobus octadecameric chaperonins 
are referred to as “rosettasomes” (Kagawa et al., 1995 J. Mol. 
Biol. 253: 712-725) to distinguish them from thermosomes. 
Other examples of thermo somes include chaperonins from 
40 Pyrodictium occultum, Thermoplasma acidophilum and 
Methanopyrus kandleri (Ellis et al., 1998 J. Struc. Biol. 123: 
30-36). It has previously been reported that rosettasomes are 
composed of two types of HSP60s known as TF55 alpha and 
beta, that TF55 alpha and beta are among the most abundant 
45 proteins in S. shibatae grown at optimal temperatures (75-83° 
C.), and that their synthesis increases at heat-shock tempera- 
tures (85-88° C.) (Kagawa et al., 1995 J. Mol. Biol. 253: 
712-725). A third related subunit of S. shibatae , has also been 
identified by sequence analyses (Archibald et al., 1999 Cur- 
50 rent Biology 9: 1053-1056). Sequence information from S. 
solfataricus (Charlebois et al., 1998 Current Opinion in 
Microbiology 1: 584-588) allowed TF55 alpha, beta, and 
gamma expression to be predicted based on codon usage 
(Karlin et al., 2001 J. Bacteriol. 183: 5025-5040). Chaper- 
55 onins from eukaryotic cytosol are referred to as “TCP1,” 
which identifies one of the proteins comprising the ring struc- 
ture, “TriC” which means TCP1 ring chaperonin, or “CCT” 
which means chaperonin containing TCP1. Any of these 
chaperonins subunit proteins can be used to produce the 
60 modified chaperonin polypeptides of the present invention. 

Sources of gene sequences encoding chaperonin polypep- 
tides include but are not limited to bacterial chaperonin genes 
encoding such proteins as Gro ES/Gro EL; archaeal chapero- 
nin genes encoding such proteins as TF55, TF56, alpha, beta, 
65 gamma, and cpn60s ; mammalian chaperonins such as Hsp60, 
HsplO, TCP-1, cpn60 and the homologues of these chapero- 
nin genes in other species (J. G. Wall and A. Pluckthun, 
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Current Biology, 6:507-516 (1995); Hartl, Nature, 381:571- Chaperonins from the different species can comprise only 
580 (1996)). Additionally, heterologous genomic or cDNA a single type of subunit or they can have different types of 
libraries can be used as libraries to select or screen for chap- subunits (e.g., archeal chaperonins comprising alpha, beta, 
eronins. gamma, etc.). These subunits are called alpha subunits, beta 

FIGS. 2A-2R show protein sequence alignments covering 5 subunits, or gamma subunits, due to some differences in the 
a representative set of Groups I (bacteria) and Group II (ar- protein sequences of the subunits of a given species. As is 
chaea and eukarya) chaperonins. The protein sequence are known to one of ordinary skill in the art. in some species yet 
sequences for 5. shibatae TF55 beta subunit (SEQ ID NO: 1), more varieties of subunits exist. The structure of chaperonins 
bacterial E. coli GroEL (SEQ ID NO:2), thermosome T. aci- ( Ellls et aL ’ 1998 ’ J - Struc - BloL 123 > 30 ; 36 ) describes a 
dophilum beta subunit (SEQ ID NO:3), cyanobacteria] syn- 10 chaperonin from Sulfolobus solfataricus with a 2:1 ratio of 
echococcus HSP60 (SEQ ID NO:4), M. acetivorans HSP60-4 alpha:beta subunit composition of the nine-membered ring 
(SEQ ID NO:5), M. tuberculosis HSP65 (SEQ ID NO:6), (rosettasomes). The present invention provides means of 
thermosome A. pernix alpha subunit (SEQ ID NO:7), ther- assembling chaperonins from only a single type of wild-type 
mosome M. mazei alpha subunit (SEQ ID NO:8), mitochon- or modified chaperonin polypeptides, or from various propor- 
drial A. thaliana HSP60 (SEQ ID NO:9), yeast TCP1 alpha 15 tlons of . the different wild-type or modified chaperonin 
subunit (SEQ ID NO: 1 0), human mitochondria] HSP60 (SEQ polypeptides. 

ID NO: 1 1 ), mouse mitochondrial HSP60 (SEQ ID NO: 1 2), In a specific embodiment, HSP60s (heat-shock proteins) in 

human TCP1 alpha subunit (SEQ ID NO: 13), mouse TCP1 organisms living at high temperatures, called “thermophiles,” 
alpha subunit (SEQ ID NO: 14), and the consensus (SEQ ID are the source of the wild-type and mutated chaperomn 
NO: 15). White letters on a black background, solid lines, and 20 polypeptides of the present invention. These proteins are 
dashed lines surround the regions of the sequence alignment present in all organisms and are among the most abundant 

containing identical residues, a block of similar residues, and proteins in extreme thermophiles, e.g., in one of the highest 
conservative matches respectively temperature thermophiles Pyrodictium occultum , they report- 

edly account for 73% of total protein (Phipps et al., 1991 The 
Folded Structure 25 EMBO Journal 10(7): 1711-1722). 

While Group I chaperonin subunit proteins can have Selecting the Position of Relocated Terminal Ends 
greater than 50% sequence identity, sequence identity among Using the amino acid sequence and/or the predicted folded 

Group II chaperonin subunit proteins can be on the order of structure of any Group I or II wild-type chaperonin subunit 

less than 33%. Despite the sequence variations among the polypeptide as a guide (FIGS. 2A-R and 15 ), one skilled in the 
cpn60 subunits from the different species, however, Group I 30 ^ can the , |mino acid position to relocate the N- and 
and Group II cpn60 subunits share significant structural sum- C-terminal ends. The new terminal end positions can he 
lanty. FIG. 3 shows a structural comparison between a sub- selected based on the following criteria: the position is not 
unit of the archaeal ( Thermoplasma acidophilum ) thermo- buried within the folded modified chaperonin polypeptide; 
some and the bacterial (E. coli ) GroEL chaperonins. the new position is not near the subunit interface; and the new 

The alignment was performed using an algorithm based on 35 position is not part of the regular secondary structure, 
the iterative dynamic programming approach (Gerstein, M. & In one embodiment, the N-terminal and C-termin a l ends 

Levitt, M., Protein Science 7: 445-456, 1998; and Gerrstein, are relocated to any position on the exterior of the folded 

M. & Levitt, M, Proc. of ISMB-96, pp. 59-67, 1996). polypeptide that occurs in the apical, intermediate or equato- 

For purposes of wild-type chaperonins and chaperonin 4Q rial domain, 
polypeptides, such three dimensional structural similarity In another embodiment, any chaperonin subunit polypep- 

serves to illustrate that fact that any chaperonin or chaperonin tide can be modified by relocating the N- and C-terminal ends 
polypeptide routinely can be utilized as part of the composi- to produce the modified polypeptides of the present invention, 
tions and devices of the present invention, either alone or including: TF55 alpha, beta or gamma subunits from Sulfolo- 
combination. For purposes of modified chaperonins and 45 bales shibatae. For example, the N- and C-terminal ends can 
chaperonin polypeptides, as discussed in detail in the next be relocated to any position 1-560 for TF55 alpha subunit, 
section, such sequence similarity serves to provide teaching position 1-552 for TF55 beta subunit, or position 1-535 for 
that allows for routine manipulation of sequences in produc- TF55 gamma subunit (FIG. 15 ). 

ing the modified chaperonin polypeptides of the present In another embodiment, using the amino acid sequence of 

invention. wild-type TF55 beta subunit from Sulfolobus shibatae as a 

The two subunits exhibit very similar structures, in that guide, the N- and C-terminal ends are relocated to any amino 
both possess an equitorial, an intermediate and an apical acid Positions within or about a region having a consensus 

region. Even though these two examples of cpn60 subunits sequence, a conserved folded structure, or non-conserved 

are farther apart by sequence than most cpn60 subunits, as sequence (FIG. 15 ). Many positions can be modified. In one 
evidenced by the very little similarity in their sequence align- 55 embodiment, the N- and C-terminal ends can be relocated to 
ments (see FIGS. 2A-2R), the crystal structures for each amino acid positions within or about the ranges: 149-158; 
reveal that they share considerable structural identity — most 263-270; 311-330; 472-487; or 494-508. In yet another 

all helical, sheet, and random coil regions correspond, as embodiment, the N- and C-terminal ends are relocated to 

shown in black in the center panel. Variations in structure are amino acid positions 153, 267, 316, 480 or 499, based on 

tolerated in the apical domain, as evidenced by the loop of the 60 wild-type HSP60 beta subunit from Sulfolobus shibatae. 
thermo some, while the equatorial domains adopt similar con- Linker Sequences 

served folding motifs. Xhe modified chaperonin polypeptides also comprise the 

It is noted that, while the chaperonins observed to date naturally -occurring N- and C-terminal ends linked directly 
comprise seven, eight or nine subunits per ring, the present together or linked with an intervening linker sequence. The 
invention provides methods and compositions of exploiting 65 intervening linker does not interfere with the ability of the 
chaperonins with any number of subunits per ring (e.g., 7, 8, modified chaperonin polypeptide to fold and assemble into 
9 or 10). double-ringed chaperonin structures. In the modified chap- 
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eronin polypeptide, the intervening linker can maintain the 
same or similar spatial organization or distance as that found 
in a folded wild type chaperonin polypeptide. The intervening 
linker can be a flexible or helical linker sequence (Arai, et al., 
2001 Protein Engineering 14:529-532; Waldo, et al., 1999 5 
Nature Biotechnology 17:691-695). The intervening linker 
sequence comprises between 1 to about 10 amino acid resi- 
dues or derivatives thereof. The intervening linker sequence 
comprises amino acid residues such as glycine, serine, ala- 
nine and/or threonine, or derivatives thereof, in any combi- to 
nation and in any order. In principle, the linker sequence can 
include any amino acid residue, however, large and/or hydro- 
phobic residues are more likely to cause problems. In one 
embodiment, the linker sequence comprises the amino acid 
sequence Gly-Gly-Ser-Gly-Gly-Thr (SEQ ID NO:64). In 15 
another embodiment, the modified chaperonin polypeptide 
comprises an additional linker sequence. For example, the 
chaperonin-EYFP fusion protein which is modified at posi- 
tion 267 comprises an additional linker comprising the 
sequence Gly-Ser-Gly-Gly-Ser-Gly (SEQ ID NO: 83) which 20 
joins the yellow fluorescent protein to the chaperonin protein 
(FIG. 26B). In another example, the chaperonin-EYFP fusion 
protein which is modified at position 480 comprises an addi- 
tional linker comprising the sequence Gly-Ser-Gly-Gly-Ser- 
Gly-Ser-Gly (SEQ ID NO:84) which joins the yellow fluo- 25 
rescent protein to the chaperonin protein (FIG. 27B). 


-continued 
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ID 
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Whaley et al. also describes amino-acid sequences that 
bind silicon and not silicon dioxide. An example of an amino- 
acid sequence that binds to ZnS( 102) (Leeet al., 2002 Science 
296: 892-895) is: 

CNNPMHQNC (SEQ ID NO: 27) 


Joined or Bound Molecules 

In one embodiment, the modified chaperonin polypeptides 
comprise relocated N- and C-terminal ends that are 
covalently joined with an organic molecule, or an inorganic 
molecule or compound. 

In a different embodiment, the modified chaperonin 
polypeptides comprise relocated N- and C-terminal ends that 
bind non-covalently with an organic molecule, or an inor- 
ganic molecule or compound. 

The organic molecule or inorganic molecule/compound is 
joined or bound to the modified polypeptide in such a way that 
it does not disrupt the structure of the modified polypeptide. 
The modified polypeptide can fold into a structure that 
resembles a wild-type folded chaperonin subunit polypep- 
tide. The folded modified polypeptide can self-assemble into 
a double-ringed chaperonin structure. The assembled chap- 
eronin structure can be organized into nanodevices or coat- 
ings. 

The organic molecule can be a second polypeptide mol- 
ecule, including but not limited to a: protein; peptide frag- 
ment; reporter molecule; metal-binding peptide; enzyme; 
hormone; growth factor; trophic factor; antibody; antigen; 
receptor; toxin; fluorescent protein; or luminescent protein. 

The second polypeptide can be from any organism includ- 
ing bacterial, viral, mammalian (e.g., bovine, porcine, 
murine, equine, canine, feline, monkey, ape, ovine or human), 
piscine, avian or insect. 

The second polypeptide includes metal-binding peptide 
sequences that bind selectively to specific faces of gallium 
arsenide, silicon or indium phosphide. An exemplary, non- 
limiting list of partial amino-acid sequences from clones that 
bind to different surfaces of GaAs and/or InP (Whaley et al., 
2000 Nature 405: 665-668) includes: 


A list of partial amino-acid sequences from clones that 
bind to Ag (Naik et al., 2002 Nature Materials 1: 169-172) 
30 includes: 

AYSSGAPPMPPF (SEQ ID NO: 2 8) 

NPSSLFRYLPSD (SEQ ID NO: 29) 

35 

SLATQPPRTPPV (SEQ ID NO: 3 0) 

A list of partial amino acid sequences from clones that bind 
to Au (Brown et al., 2000 J. Mol. Biol. 299, 725-735; Brown, 
40 S., 1997, Nature Biotechnol. 15, 269-272) includes: 

MHGKTQATSGTIQS (SEQ. ID. NO: 31) 

ALVPTAHRLDGNMH (SEQ. ID. NO: 32) 

45 

Further examples of peptide sequences that bind inorganic 
molecules or compounds have been discovered using bacte- 
rial cell surface or phage display procedures, as reviewed by 
5Q Sarikaya (2003 Nature Materials 2:577-7585). 

A list of amino acid sequences from clones that bind to Au 
(Sarikaya, et al., 2003 Nature Materials 2:577-585) includes: 

55 SKTSLGQSGASLQGSEKLTNG (SEQ ID NO: 4 0) 

QAT S E KLVRGMEGAS LHP AKT (SEQ ID NO: 41) 

A list of amino acid sequences from clones that bind to Pt 
(Sarikaya, et al., 2003 Nature Materials 2:577-585) includes: 


VTSPDSTTGAMA (SEQ ID NO: 16) 

AASPTQSMSQAP (SEQ ID NO: 17) 


65 


DRTSTWR (SEQ ID NO: 42) 

QSVTSTK (SEQ ID NO: 43) 


AQNPSDNNTHTH 


(SEQ ID NO: 18) 


SSSHLNK 


(SEQ ID NO: 44) 
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A list of amino acid sequences from clones that bind to Pd A list of an amino acid sequences from clones that bind to 
(Sarikaya, et al., 2003 Nature Materials 2:577-585) includes: ZnS (Sarikaya, et al., 2003 Nature Materials 2:577-585) 

includes: 


SVTQNKY (SEQ ID NO: 45) 

SPHPGPY (SEQ ID NO: 46) 


NNPMHQN (SEQ ID NO: 63) 


HAPTPML (SEQ ID NO: 47) 

A list of amino acid sequences from clones that bind to 
Si0 2 (Sarikaya, et ah, 2003 Nature Materials 2:577-585) 
includes: 


MS PHPHPRHHHT 
RGRRRRLSCRLL 
KPSHHHHHTGAN 


(SEQ ID NO: 48) 
(SEQ ID NO: 49) 
(SEQ ID NO: 50) 
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A list of amino acid sequences from clones that bind to 
zeolites (Sarikaya, et ah, 2003 Nature Materials 2:577-585) 20 
includes: 


VKTQATSREEPPRLPSKHRPG (SEQ ID NO: 51) 

MDHGKYRQKQAT PG (SEQ ID NO: 52) 

A list of amino acid sequences from clones that bind to 
ZnO (Sarikaya, et ah, 2003 Nature Materials 2:577-585) 
includes: 


25 


30 


NTRMTARQHRSANHKS TQRA (SEQ ID NO: 53) 

YDSRSMRPH (SEQ ID NO: 54) 

A list of amino acid sequences from clones that bind to 
CaC0 3 (Sarikaya, et ah, 2003 Nature Materials 2:577-585) 
includes: 


HTQNMRMYEPWF (SEQ ID NO: 55) 

DVFSSFNLKHMR (SEQ ID NO: 56) 

A list of amino acid sequences from clones that bind to 
Cr 2 0 3 (Sarikaya, et ah, 2003 Nature Materials 2:577-585) 
includes: 


WRPKAATN (SEQ ID NO: 57) 

RIRHRLVGQ (SEQ ID NO: 58) 50 

A list of an amino acid sequence from clones that bind to 
Fe 2 0 3 (Sarikaya, et ah, 2003 Nature Materials 2:577-585) 
includes: 


55 


RRTVKHHVN (SEQ ID NO: 59) 

A list of amino acid sequences from clones that bind to 
GaAs (Sarikaya, et ah, 2003 Nature Materials 2:577-585) 
includes: 60 


AQNPSDNNTTH 

RLELAIPLQGSG 

TPPRPIQYNHTS 


(SEQ ID NO: 60) 
(SEQ ID NO: 61) 
(SEQ ID NO: 62) 
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The second polypeptide includes a reporter molecule, such 
as yellow fluorescent protein (EYFP) (SEQ M NO: 9 1), green 
fluorescent protein (GFP), red fluorescent protein (RFP), DS 
red from coral, auto -fluorescent proteins including blue fluo- 
rescent protein (BFP), cerulean fluorescent protein (CFP), 
luciferase, glutathione-S -transferase (GST), horseradish per- 
oxidase (HRP), chloramphenicol acetyltransferase (CAT) 
beta-galactosidase, beta-glucuronidase, and any derivatives 
of these reporter molecules (Labas, 2002 Proc Natl Acad Sci 
USA 99:4256-4261; Shaner 2004 Nature Biotechnology 
22:1567-1572; Patterson 2001 Cell Science 114: 837-838; 
Wei Wen Su 2005 Microbiol Cell Factories 4:12; Rizzo 2004 
Nature Biotechnology 22:445-449; Griesbeck 2001 Journal 
Biological Chemistry 276:29188-29194). 

The second polypeptide includes an epitope tag, such as 
histidine (His) tags, or V5 tags, FLAG tags, influenza hemag- 
glutinin (HA) tags, Myc tags, VSV-G tags, or thioredoxin 
(Trx) tags. The tagged-fusion molecules are useful for facili- 
tating isolation and/or purification of the modified chaperonin 
polypeptides (Marshak, D. R., et ah, 1996 in: “Strategies for 
Protein Purification and Characterization” pp 396). 

The second polypeptide can include maltose binding pro- 
tein (MBP), S-tag, Lex A DNA binding domain (DBD) 
fusions, GAL4 DNA binding domain fusions, and herpes 
simplex virus (HSV) BP 16 protein. 

The second polypeptide can be engineered to include a 
cleavage site located between the modified chaperonin 
polypeptide and the second polypeptide so that the modified 
chaperonin polypeptide can be cleaved and purified away 
from the second polypeptide. The cleavage site can include 
recognition sequences for the following enzymes: enteroki- 
nase, corin, MT-SP/matryptase, trypsin, chymotrypsin, 
human airway trypsin-like protease (HAT), mast cell 
tryptase, elastase, plasmin, kallikrein, TMPRSS2, MBL-as- 
sociated serine proteases (MASP- 1 and MASP-2), Stubble- 
stubbloid, furin, thrombin or factor Xa. 

The organic molecule can be a nucleic acid molecule, 
including but not limited to: DNA; RNA; DNA/RNA hybrids; 
or derivatized nucleic acid molecules. The organic molecule 
can be a lipid or a carbohydrate. 

The inorganic molecule or compound is a nanoparticle 
which includes: gallium arsenide; silicon; silicon dioxide; 
indium phosphide; ZnS; gold; silver; CdSe — ZnS; CdS; 
CdSe; InP; InGaAs; CuCl; InAs quantum dots; silicon nanoc- 
rystals and nanopyramids; silver nanoparticles; magnetic 
quantum dots (e.g., nanomagnets, such as CoCu, FeCu, NiFe/ 
Ag, and CoAg); transition metals (e.g., gold, silver, zinc, 
cadmium, platinum, palladium, cobalt, mercury or nickel); 
alkali or alkaline earth metals (e.g., sodium, potassium, cal- 
cium or cesium); group III elements (e.g., aluminum, gallium 
or indium); group IV elements (e.g., silicon, germanium, tin 
or lead); group V elements (e.g., phosphorous, arsenic, anti- 
mony, or bismuth); or group VI elements (e.g., sulfur, sele- 
nium or tellurium). The inorganic molecules or compounds 
also include any given combination, such as III-V compounds 
like GaAs or AlGaAs. The inorganic molecules or com- 
pounds also include: a fullerene; a carbon nanotube; or a 
dielectric, polymeric, or semiconducting nanoparticle. 
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Amino Acid Analogs and Altered Polypeptides 

The present invention further provides modified chapero- 
nin polypeptides, or fragments or derivatives thereof, com- 
prising amino acid analogs. The amino acid analogs can be 
chemically synthesized, and include dextro or levo forms, or 
peptidomimetic s . 

The modified chaperonin polypeptides can include any 
combination of amino acids from natural and/or or non-natu- 
ral sources. The amino acid residues can include L-amino 
acids and/or D-amino acids. The amino acid residues can 
include rare amino acids such as 4-hydroxyproline or 
hydroxy lysine. The amino acid can include organic acids or 
amides. The amino acid can include derivatives of common 
amino acids, such as amino acids having the C-terminal car- 
boxylate esterified (e.g., benzyl, methyl or ethyl ester) or 
amidated and/or having modifications of the N-terminal 
amino group (e.g., acetylation or alkoxycarbonylation). The 
amino acids can include side-chain modifications and/or sub- 
stitutions (e.g., methylation, benzylation, t-butylation, tosy- 
lation, alkoxycarbonylation, and the like). The amino acids 
can include derivatives having an N-acetyl group such that the 
amino group that represents the N-terminus of the linear 
peptide prior to cyclization is acetylated, and/or a C-terminal 
amide group (i.e., the carboxy terminus of the linear peptide 
prior to cyclization is amidated). The amino acids can 
include, but are not limited to, penicillamine, beta-beta-tet- 
ramethylene cysteine, beta-beta-pentamethylene cysteine, 
beta-mercaptopropionic acid, beta-beta-pentamethylene- 
beta-mercaptopropionic acid, 2-mercaptobenzene, 2-mer- 
captoaniline, 2-mercaptoproline, ornithine, diaminobutyric 
acid, alpha-aminoadipic acid, m-aminomethylbenzoic acid, 
and alpha-beta-diaminopropionic acid. 

The present invention also provides modified chaperonin 
polypeptides which are altered, for example, by post-transla- 
tional pathways or by chemical synthesis, including N- or 
O-glycosylated amino acid residues. The N-terminal end of 
the polypeptides can be altered to include acylated or alky- 
lated residues. The C-terminal end of the polypeptides can be 
altered to include esterified or amidated residues. The non- 
terminal amino acid residues can be altered, including but not 
limited to, alterations of the amino acids, arginine, aspartic 
acid, asparagine, proline, glutamic acid, lysine, serine, threo- 
nine, tyrosine, histidine, and cysteine. 

Sequence Variants 

The present invention provides modified chaperonin 
polypeptides, or fragments or derivatives thereof, comprising 
sequence variations of any naturally-occurring chaperonin 
polypeptide (FIG. 2A-R, 15, 16, or 28B). As persons skilled 
in the art understand, any number of amino acids can be 
varied alone, or in combination with other amino acids and 
yet the modified chaperonin polypeptides will retain their 
functional activity (e.g., folding into a subunit structure that 
self-assembles into a double-ringed chaperonin structure). 
Sequence variants of the modified chaperonin polypeptides 
include: amino acid substitutions, amino acid insertions, 
amino acid deletions, mutant forms, allelic forms, and 
homologs. 

The sequence variants also includes chaperonin polypep- 
tides from naturally-occurring or laboratory strains, where 
the amino acid sequence of the chaperonin polypeptide dif- 
fers from that of any of the Group I or II chaperonin polypep- 
tides disclosed herein (see also FIGS. 2A-R, 15, 16 and 
21-29) 

Amino Acid Substitutions 

The modified chaperonin polypeptides, or fragments or 
derivatives thereof, can include amino acid substitutions, 
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including conservative and/or non-conservative amino acid 
substitutions. Guidance in determining which and how many 
amino acid residues can be substituted in the modified chap- 
eronin polypeptides can be found in the properties of a natu- 
5 rally-occurring, chaperonin polypeptide. These properties 
include the amino acid length, the physical length, the folded 
conformation, or the ability to self-assemble into a chapero- 
nin. These properties can be derived by prediction (e.g. , based 
on amino acid sequence) and/or experiment (e.g., based on 
10 X-ray crystallography or transmission electron microscopy 
(TEM)). The substituted amino acids are selected so that the 
properties of the variant, modified chaperonin polypeptides is 
identical or similar to that of a naturally-occurring chaperonin 
1 subunit polypeptides . In one embodiment, the modified chap - 
eronin polypeptides comprise a cysteine substituted with 
another amino acid. For example the cysteine at position 258 
is substituted with alanine, as shown in FIG. 28B. 

Mutant Forms 

20 

The present invention also provides modified chaperonin 
polypeptides, or fragments or derivatives thereof, having a 
mutation. The mutant variant has an amino sequence that 
differs from that of the wild-type, naturally-occurring chap- 
eronin polypeptide. The mutations include amino acid sub- 
stitutions, deletions, insertions, additions, truncations, or pro- 
cessing or cleavage errors of the protein. The mutant variants 
can have the same or similar functional activity of a wild- type 
chaperonin polypeptide. 

30 In one embodiment, the modified chaperonin polypeptides 
comprise deletions of up to 10, 20, 30, 40, 50, 60 or 70 amino 
acids from the N- or C-terminal end. In another embodiment, 
the modified chaperonin polypeptides comprise additional 
deleted amino acids. For example, a “dwarf’ modified chap- 
35 eronin polypeptide (modified at position 267) comprises an 
additional 52 amino acids deleted from the N-terminal end of 
(e.g., FIG. 29B) compared to the 267 ‘non-dwarf’ modified 
polypeptide (FIG. 22B). These deletion variants retain the 
ability to fold and assemble into double-ringed chaperonin 
40 structures. FIG. 29A shows the corresponding DNA 
sequence. 

Designing Modified Chaperonin Polypeptides 

The present invention provides methods for producing 
45 modified chaperonin polypeptides through selective mutation 
of the polypeptide, and then exploiting the ability of these 
variants to self-assemble into higher-order structures under 
various conditions for forming the compositions and devices, 
e.g., nanotemplates, nanostructures, nanoarrays and nanode- 
5Q vices, of the invention. 

The compositions and devices of the invention, e.g., the 
nanotemplates, nanostructures, nanoarrays and nanodevices 
of the invention, comprise, unless otherwise indicated, at least 
one mutant chaperonin, which comprises at least one mutant 
55 chaperonin polypeptide. Non-limiting examples of mutant 
chaperonins and mutant chaperonin polypeptides that can be 
utilized as part of the methods and compositions of the 
present invention are described herein. 

In referring to mutant chaperonins and mutant chaperonin 
60 polypeptides, the term “mutant” refers to a difference relative 
to what is considered a wild-type sequence. Representative, 
non-limiting examples of wild-type chaperonin polypeptide 
sequences are presented in FIGS. 2A-2R. Generally, a mutant 
chaperonin sequence refers to a sequence that does not occur 
65 in nature at a greater than 10% (±10%) allelic frequency, as 
measured by standard methods and available data. For 
example, an example of a mutant S. shibatae chaperonin 
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polypeptide is one that is expressed by an allele that is present 
in the organism at no greater than 10% (±10%) allelic fre- 
quency. 

It is appreciated by those skilled in the art that the sequence 
and three dimensional structural similarities between chaper- 5 
onins and chaperonin polypeptides from different organisms 
can be used as a guide to utilize any chaperonin polypeptide 
to produce the modified chaperonin polypeptides and 
assembled structures of the present invention. The sequence 
and three dimensional structural similarity of the subunits 10 
among the different types of chaperoning, which is illustrated 
by the sequence alignment depicted in FIGS. 2A-2R and the 
structural overlap as illustrated in a representative compari- 
son depicted in FIG. 3, provides the basis for the production 
of the modified chaperonin polypeptides and assembled 15 
chaperonins of the invention. 

Further, the details of the structure of chaperonins can be 
solved at atomic -resolution (2.3-2. 8 A) (See, e.g., FIG. 1, and 
Xu, Z. et al., 1997 Nature 388: 741-750; and Ditzel, L., J. 
Lowe, et al., 1998 Cell 93: 125-138). This provides detailed 20 
information about the location of every atom of every amino 
acid in the double ring structure (e.g., FIG. 4 ), and can be used 
to routinely select positions for modification and can rou- 
tinely assess the properties of the modified chaperoning. 

Utilizing the sequence and three dimensional structural 25 
similarities among chaperonins and chaperonin polypeptides, 
as well as the ability to solve at atomic-resolution the struc- 
ture of particular chaperonins and chaperonin polypeptides, 
the structure of the chaperonin polypeptides can be manipu- 
lated to influence, for example, their folding, assembly, 30 
strength, and binding properties, as well as the assembly, 
strength and binding properties of the resulting chaperoning. 

Such structural similarities can be utilized in a number of 
different ways in designing appropriate mutants. For 
example, a mutant in one species that exhibits a desirable 35 
characteristic can be introduced into a corresponding position 
in another chaperonin by utilizing the sequence similarity 
and/or the three dimensional structural similarity between the 
chaperoning. In one such embodiment, for example, the 
mutant S. shibatae sequences successfully utilized in the 40 
examples presented below can routinely be introduced into 
other chaperonin polypeptides by these techniques. 

Standard methods well known in the art which allow 
changing specific amino acids in chaperonin polypeptides, 
such as the method of site-directed mutagenesis, regions of 45 
the subunits can be modified, and the resulting chaperonin 
polypeptides can routinely be tested for their ability to pro- 
duce chaperonins and, for example, nanotemplates, nano- 
structures, nanoarrays and nanodevices, e.g., their ability to 
assemble into tubes and filaments or two-dimensional crys- 50 
tals can be tested. In one embodiment, for example, amino 
acid tails can be attached to chaperonin polypeptide subunits 
that do not inhibit their ability to assemble into rings and 
tubes, and that allow the binding of various nanoscale mate- 
rials, such as metals, at various locations of the chaperonins, 55 
including inside the chaperonin structure. In one embodi- 
ment, one of the three HSP60 subunits (beta) from Sulfolobus 
shibatae , an organism that lives in geothermal hot-springs 
and grows at temperatures of up to 85° C./pH 2.0 is used to 
form mutant chaperonins. The chaperonins in S. shibatae are 60 
octadecameric with nine subunits per ring. FIG. 15 shows the 
protein sequence alignment of S. shibatae TF55 alpha subunit 
(SEQ ID NO: 39), beta subunit (SEQ ID NO: 1) and gamma 
subunit (SEQ ID NO: 38). The beta subunit can be chosen for 
a particular application based on such factors as its thermo- 65 
stability, which makes it easy to purify as a recombinant 
protein, and its propensity to form desirable structures. 
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In general, the chaperonin subunits have many regions that 
can accommodate relocated N- and C-terminal ends in each 
of their three domains-equatorial, intermediate, and apical 
domains, as illustrated in FIG. 3. FIG. 4 shows the detailed 
structure of a Group II chaperonin subunit that can be used in 
selecting the relocated terminal positions. 

Other modified chaperonin polypeptides have been previ- 
ously described. These modified chaperonin polypeptides 
comprise mutations having single cysteine residues placed at 
various solvent-exposed sites on the polypeptide. The thiol 
moiety of these cysteine residues provides binding sites for 
soft metals (see PCT/US02/35889). These mutations also 
change the dimensions of the resulting chaperonin subunit, 
such as length, inner pore diameter, outer diameter, and the 
like. The mutant chaperonin polypeptides described in PCT/ 
US02/35889 can be used as a guide to select the positions of 
the relocated N- and C-terminal ends in the modified chap- 
eronin polypeptides of the present invention. The modified 
chaperonin polypeptides of the present invention can use any 
of the mutant chaperonin polypeptides described in PCT/ 
US02/35889 as a basis, and relocate the N- and C-terminal 
ends therein. 

The choice of mutation may also depend on the desired 
structure for the different applications of the present inven- 
tion, including the formation of nanotemplates, nanostruc- 
tures, nanoarrays and nanodevices. 

The choice of mutations to make depends on the desired 
structure of the resulting chaperonin, and can routinely be 
ascertained. In a specific embodiment, the mutated chapero- 
nin polypeptide subunits include ones that assemble into 
higher order structures with less than seven subunits per ring 
or more than nine subunits per ring, such as six to 10 subunits 
per ring. Mutations can be made to the subunit sequence such 
that the resulting subunit variants assemble into a structure 
with any number of subunits per ring. Mutations introduced 
that that change in number of subunits per ring can, for 
example, be used to modify the diameter of a resulting ring 
nanostructure. 

Factors that affect the choice of which chaperonin polypep- 
tides to manipulate (e.g., from what species, which sub- 
unit(s), etc.), and what mutations are to be made to them, 
include the desired dimensions, i.e., length, pore diameter, 
and outer diameter, of the resulting chaperonin product, or 
introduction of a selective binding site anywhere on the 
polypeptide. The subunits of both Group I and Group II 
chaperonins will tolerate a point mutation at almost any posi- 
tion. When sequence alignments are used in determining 
mutation positions, mutations at similar, non-identical resi- 
dues, as determined by sequence alignment, being preferred, 
and non-conserved positions, as determined by sequence 
alignment being more preferred. When three dimensional 
structural alignments are used in determining mutation posi- 
tions, a structural alignment of chaperonin subunits, such as 
that of FIG. 3, can serve as a guide in deciding where on the 
subunit to perform the mutation. The loops and turns from the 
two structures that do not directly superimpose can be choices 
of points to perform mutations, including deletions and inser- 
tions. The amino acid sequence alignment of various chaper- 
onins can also be used in determining the mutation positions. 
For example, FIG. 15 depicts sequence alignment of TF55 
alpha, beta and gamma subunits from S. shibatae. In the 
modified chaperonin polypeptides of the present invention, 
the N- and C-terminal ends can be relocated to any position 
1-560 for TF55 alpha subunit, position 1-552 for TF55 beta 
subunit, or position 1-535 forTF55 gamma subunit (FIG. 15 ). 

In one embodiment, a choice of deletion of the amino acid 
loop at the apical domain of a group II chaperonin is made 
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through comparison of the structural alignment of FIG. 3, and 
with the observation that the loopless group I chaperonin 
subunit assembles into the double-ring structure of the chap- 
eronin. In another embodiment, the N- or C-terminus is 
removed. In yet another embodiment, the N- or C-terminus is 
modified by inserting a sequence. The sequence can be 
inserted for binding specificity, such as by introducing cys- 
teine or tyrosine which can be modified chemically. 

In one embodiment, the mutant chaperonin comprises one 
more mutated chaperonin polypeptide sequences with one or 
more point mutations. An exemplary point mutation in TF55- 
beta from Sulfolobus shibatae results from residue 298 being 
changed from cysteine to alanine and residue 270 changed 
from glutamine to cysteine (see PCT/US02/35889). In 
another embodiment, the mutant chaperonin comprises one 
more mutated chaperonin polypeptide sequences with one or 
more sequences deleted. An exemplary deletion inTF55-beta 
results from Sulfolobus shibatae with residues 254 to 281 
deleted (see PCT/US02/35889). In another embodiment, the 
mutant chaperonin comprises one or more mutated chapero- 
nin polypeptide sequences with one or more polypeptide 
sequences inserted. An exemplary insertion in TF55-beta 
results from Sulfolobus shibatae with peptides that possess 
binding specificity inserted. As discussed above, correspond- 
ing mutations can be routinely introduced into any other 
chaperonin polypeptide. 

In another embodiment, the peptides are designed to bind 
nanoscale materials such as nanoparticles and quantum dots. 
In yet another embodiment, the peptides are designed to bind 
only to specific surfaces. Still other modifications can also be 
made in the equatorial domains that include deletions, sub- 
stitutions and additions to the N- and C-termini with little 
effect on the formation of chaperonins or nanotemplates such 
as filaments. For example, up to about 5, 10, 15, 20, 25, or 30 
amino acids of the N- and/or C-terminus of the chaperonin 
polypeptide can be modified, e.g., deleted. For example, 
GroEL can be modified by removing up to about 27 amino 
acids from the C-terminus without impairing its ability to 
assemble into double rings. 

Additional references that describe possible mutations of 
specific residues of the polypeptides are contained in the 
review article by Fenton (1997 Protein Science 6: 743-760). 

The sequence alignment of FIGS. 2A-2R indicates that the 
regions that have been manipulated in S. shibatae also exist in 
other species. Whatever mutations have been successfully 
made in one species may be successfully others species, 
whether bacterial, other archea or eukarya. The correspond- 
ing regions of the sequence alignments can therefore serve as 
a guide in choice of manipulations to produce variants in 
other species, combined with the knowledge of the region of 
the chaperonin subunit that the given mutated sequence is 
located. A successful mutation of the chaperonin polypeptide 
from any given species is indicated if the mutated chaperonin 
polypeptide retains its ability to assemble into the higher 
order structures of the invention, including the nanotem- 
plates, nanostructures, nanoarrays and nanodevices. 

In a specific embodiment, guided by structural informa- 
tion, the beta subunit of Sulfolobus shibatae is genetically 
modified to add chemically reactive sites without destroying 
its ability to assemble into chaperonins and 2D crystals. 
While a detailed three-dimensional structure of S. shibatae 
beta is not known, X-ray structures for homologous chapero- 
nin subunits are known (See, e.g., Xu et al. And Diztel et al., 
supra.). Detailed transmission electron microscopic (TEM) 
analyses of S. shibatae chaperonins have also been reported 
(Trent et al., 1997 Proc. Nat. Acad. Sci 94: 5383-5388). Using 
X-ray structures of homologous subunits and TEM analyses 
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of Sulfolobus chaperonins, a hypothetical three-dimensional 
model for the beta chaperon can be produced, and used to 
guide genetic manipulations (See, e.g., Peitsch, M. C., 1995 
Bio/Technology 13: 658; Guex, N., Peitsch, M. C., 1997 
5 Electrophoresis 18: 2714; Guex, N., Diemand, A., Peitsch, M. 
C., 1999 TiBS 24: 364). At least two classes of beta mutants 
can be created using site-directed mutagenesis, many of 
which retain their ability to assemble into chaperonins that 
form 2D crystals (FIGS. 7B and 7D). 
l o The present invention provides at least five different modi - 

fied chaperonin polypeptides which retain the ability to 
assemble into chaperonin structures. These modified 
polypeptides can be used as a guide to other modified chap- 
eronin polypeptides. 

15 In two classes of beta mutants of S. shibatae , the single 
native cysteine residue in beta can be changed to a nonreac- 
tive residue, for example, an alanine residue, e.g., to prevent 
potential issues with folding and with assembly of mutant 
subunits. A cysteine can then placed at different solvent- 
20 exposed sites. The thiols of these cysteines can provide bind- 
ing sites for soft metals including gold and zinc (see PCT/ 
US02/35889). In one class of beta mutants of S. shibatae , the 
exposed cysteine is placed near the tip of a 28 amino acid loop 
on the apical domain of beta, which in the assembled chap- 
25 eronin protrudes into the central cavity. This mutant chapero- 
nin has a ring of reactive thiols with a diameter of approxi- 
mately 3 nm on both ends (FIG. 7A). In the other class of beta 
mutants of S. shibatae , the protruding 28 amino acid loop was 
removed and placed the exposed cysteine on the apical 
30 domain itself. The mutant chaperonin assembled from this 
subunit has a ring of reactive thiols with a diameter of 
approximately 9 nm and an open pore into its central cavity 
(FIGS. 7D, 7E). 

The beta subunit of S. shibatae proves to have sufficient 
35 structural plasticity in its apical domain to accommodate both 
the amino acid substitutions and deletions can be made with- 
out loss of its ability to form chaperonins and 2D crystals. 
Under reducing conditions both classes of beta mutants 
formed chaperonins that assembled into disk-shaped, hex- 
40 agonally packed 2D crystals up to 20 pm in diameter (FIGS. 
7B and 7D), the crystalline lattice ordering of which is con- 
firmed by fast Fourier transformation (FFT) of the TEM 
images FIG. 7D, inset). 

With knowledge of the sequences of the group I or group II 
45 chaperonin polypeptide, any number of mutations can be 
judiciously placed at one or more areas of the apical, inter- 
mediate and/or equatorial domains of the chaperonin 
polypeptide. As evidenced by the sequence alignment of 
FIGS. 2A-2B, the regions that have been manipulated in S. 
50 shibatae also exist in other species. Whatever mutations work 
in one species can be made to work in others. These corre- 
sponding regions of the sequence alignments can therefore 
serve as a guide in choice of manipulations to produce vari- 
ants in other species. Thus, the many different varieties of 
55 binding sites that can be placed at different locations on a 
chaperonin can be exploited in the formation of the nanotem- 
plates, nanostructures, nanoarrays and nanodevices of the 
present invention. 

60 Nucleic Acid Molecules 

The present invention provides various isolated nucleic 
acid molecules, or fragments or derivatives thereof, compris- 
ing polynucleotide sequences encoding the modified chap- 
eronin subunit polypeptides of the present invention. The 
65 nucleic acid molecules of the present invention are also 
termed “nucleic acid molecules encoding the modified 
polypeptides” or “nucleic acid molecules of the present 
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invention”. For example, the polynucleotide sequence of a 
nucleic acid molecule encoding the modified chaperonin sub- 
unit polypeptide is shown in FIGS. 21A, 22A, 23A, 24A, 
25A, 2 6 A, 21 A and 29A. 

The present invention further provides related polynucle- 
otide molecules, such as complementary modified chapero- 
nin subunit polypeptides, or a part thereof, and those that 
hybridize to the nucleic acid molecules of the invention. 

The polynucleotide sequences encoding the modified 
chaperonin subunit polypeptides, are preferably in isolated 
form, and include, but are not limited to, DNA, RNA, DNA/ 
RNA hybrids, and related molecules, and fragments thereof. 
Specifically contemplated are genomic DNA, cDNA, 
ribozymes, and antisense RNA or DNA molecules, as well as 
nucleic acids molecules based on an alternative backbone or 
including alternative bases, whether derived from natural 
sources or synthesized. 

In accordance with the practice of the invention, the nucleic 
acid molecules of the invention can be isolated full-length or 
partial length molecules. The nucleotide sequence of the 
invention can encode all or portions of the modified chapero- 
nin subunit polypeptides of the invention. 

Isolated Nucleic Acid Molecules 

The nucleic acid molecules of the invention are preferably 
in isolated form, where the nucleic acid molecules are sub- 
stantially separated from contaminant nucleic acid molecules 
having sequences other than modified chaperonin subunit 
sequences. A skilled artisan can readily employ nucleic acid 
isolation procedures to obtain isolated, modified chaperonin 
subunit sequences (Sambrook et al., in: “Molecular Cloning” 
(1989)). The present invention also provides for isolated 
modified chaperonin subunit sequences generated by recom- 
binant DNA technology or chemical synthesis methods. The 
present invention also provides nucleotide sequences isolated 
from various organisms including bacterial, viral, mamma- 
lian (e.g., bovine, porcine, murine, equine, canine, feline, 
monkey, ape, ovine or human), piscine, avian or insect. 

The isolated nucleic acid molecules include DNA, RNA, 
DNA/RNA hybrids, and complementary sequences, or a frag- 
ment or derivative thereof, and those which hybridize to the 
nucleic acid molecules encoding the modified chaperonin 
subunit polypeptides. The preferred nucleic acid molecules 
have nucleotide sequences identical to or similar to the nucle- 
otide sequences disclosed herein. Specifically contemplated 
are genomic DNA, RNA e.g., small interfering RNA, cDNA, 
ribozymes and antisense molecules. 

Linker Sequences 

The present invention provides nucleic acid molecules 
encoding the modified polypeptides having the naturally- 
occurring N- and C-terminal ends linked directly together or 
linked with an intervening linker sequence. The nucleic acid 
molecule can encode the intervening linker sequence com- 
prising 1 to about 10 amino acid residues. The nucleic acid 
molecules encode a flexible or helical linker peptide. The 
intervening linker sequence comprises amino acid residues 
such as glycine, serine, alanine, and/or threonine, or deriva- 
tives thereof. In one embodiment, the nucleic acid molecule 
encodes a linker sequence comprising the amino acid 
sequence Gly-Gly-Ser-Gly-Gly-Thr (SEQ ID NO: 64). In 
another embodiment, the nucleic acid molecule comprises 
the nucleotide sequence GGTGGTTCTGGTGGTACC (SEQ 
ID NO: 65) shown as part of the sequences in FIGS. 21A, 22A, 
23A, 24A, 25A 26A, 27A and 29A. 
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The Second Polypeptide Sequences 

The present invention provides nucleic acid molecules 
encoding the modified polypeptides joined with a second 
polypeptide sequence. The modified chaperonin subunit 
5 sequence is operatively joined, in-frame, to the second 
polypeptide sequence. 

The nucleotide sequence encoding the second polypeptide 
can encode a: protein; peptide fragment; reporter molecule; 
metal-binding peptide; enzyme; hormone; growth factor; 
to trophic factor; antibody; receptor; toxin; fluorescent protein; 
luminescent protein; or metal-binding peptide. These second 
polypeptides are described above. 

Sequence Identity and Similarity 
15 The present invention provides isolated nucleic acid mol- 
ecules having a polynucleotide sequence identical or similar 
to the modified chaperonin subunit sequences disclosed 
herein. 

One embodiment of the invention provides nucleic acid 
20 molecules that exhibit sequence identity or similarity with the 
modified chaperonin subunit nucleotide sequences, such as 
molecules that have at least 60 % to 99 . 9 % sequence similarity 
and up to 1 00% sequence identity with the sequences of the 
invention as shown in FIG. 21A, 22A, 23A, 24A, 25A, 26A, 
25 27A, or 29A. Another embodiment provides nucleic acid 
molecules that exhibit between about 75 % to 99 . 9 % sequence 
similarity, and another embodiment provides molecules that 
have between about 86 % to 99 . 9 % sequence similarity. Yet 
another embodiment provides molecules that have 100% 
30 sequence identity with the modified chaperonin subunit 
sequences of the invention as shown in FIG. 21 A, 22 A, 23 A, 
24A, 25A, 2 6 A, 27A, or 29A. 

Complementary Nucleotide Sequences 

The present invention also provides nucleic acid molecules 
35 that are complementary to the sequences as described in FIG. 
21A, 22A, 23 A, 24A, 25A, 26A, 27A, or 29A. Complemen- 
tarity can be full or partial . A nucleotide sequence that is fully 
complementary is complementary to the entire modified 
chaperonin subunit sequence as described in any one of FIG. 
40 21 A, 22A, 23 A, 24A, 25A, 26A, 27A, or 29 A. A nucleotide 
sequence that is partially complementary is complementary 
to only a portion of sequences as described in any one of FIG. 
21 A, 22A, 23A, 24A, 25A, 26A, 27 A, or 29A. The comple- 
mentary molecules include anti-sense nucleic acid mol- 
45 ecules. 

Hybridizing Nucleic Acid Molecules 

The present invention further provides nucleic acid mol- 
ecules having polynucleotide sequences that selectively 
50 hybridize to the nucleotide sequence of the invention as 
shown in any one of FIG. 21A, 22A, 23 A, 24A, 25A, 26A, 
27A, or 29A. The nucleic acid molecules that hybridize can 
hybridize under high stringency hybridization conditions. 
Typically, hybridization under standard high stringency con- 
55 ditions will occur between two complementary nucleic acid 
molecules that differ in sequence complementarity by about 
70% to about 1 00%. It is readily apparent to one skilled in the 
art that the high stringency hybridization between nucleic 
acid molecules depends upon, for example, the degree of 
60 identity, the stringency of hybridization, and the length of 
hybridizing strands. The methods and formulas for conduct- 
ing high stringency hybridizations are well known in the art 
(Sambrook, et al., in: “Molecular Cloning” (1989)). 

In general, stringent hybridization conditions are those 
65 that: (1) employ low ionic strength and high temperature for 
washing, for example, 0.01 5M NaCl/0.0015M sodium cit- 
rate/0. 1% SDS at 50 degrees C.; or (2) employ during hybrid- 
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ization a denaturing agent such as formamide, for example, 
50% (vol/vol) formamide with 0.1% bovine serum albumin/ 
0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium 
phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM 
sodium citrate at 42 degrees C. 

Another example of stringent conditions include the use of 
50% formamide, 5xSSC (0.75M NaCl, 0.075 M sodium cit- 
rate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyro- 
phosphate, 5xDenhardt’s solution, sonicated salmon sperm 
DNA (50 mg/ml), 0.1% SDS, and 10% dextran sulfate at 42 
degrees C., with washes at 42 degrees C. in 0.2xSSC and 
0.1% SDS. A skilled artisan can readily determine and vary 
the stringency conditions appropriately to obtain a clear and 
detectable hybridization signal. 

Nucleic Acid Fragments 

The present invention further provides nucleic acid mol- 
ecules having fragments of the modified chaperonin subunit 
sequences of the invention, such as a portion of the sequences 
shown in any one of FIGS. 21 A, 22A, 23 A, 24A, 25A, 26A, 
27A, or 29A. The size of the fragment will be determined by 
its intended use. For example, if the fragment is chosen to 
encode a full-length modified chaperonin subunit polypep- 
tide, then the skilled artisan shall select the polynucleotide 
fragment that is large enough to encode this polypeptide. If 
the fragment is to be used as a nucleic acid probe or PCR 
primer, then the fragment length is chosen to obtain a rela- 
tively small number of false positives during a probing or 
priming procedure. 

The nucleic acid molecules, fragments thereof, and probes 
and primers of the present invention are useful for a variety of 
molecular biology techniques including, for example, hybrid- 
ization screens of libraries, or detection and quantification of 
mRNA transcripts as a means for analysis of gene transcrip- 
tion and/or expression. The probes and primers can be DNA, 
RNA or derivatives of DNA or RNA molecules. A probe or 
primer length of at least 15 base pairs is suggested by theo- 
retical and practical considerations (Wallace, B. and Miyada, 
G. 1987 in: “Oligonucleotide Probes for the Screening of 
Recombinant DNA Libraries” in: Methods in Enzymology, 
152:432-442, Academic Press). 

Fragments of the modified chaperonin nucleotide 
sequences that are particularly useful as selective hybridiza- 
tion probes or PCR primers can be readily identified from the 
modified chaperonin nucleotide sequences, using art-known 
methods. For example, sets of PCR primers that bind and/or 
detect a portion of nucleic acid molecule encoding the modi- 
fied chaperonin subunit polypeptides can be made by PCR 
methods (described in U.S. Pat. No. 4,965,188). The probes 
and primers of this invention can be prepared by methods well 
known to those skilled in the art (Sambrook, et al. supra). The 
probes and primers can be synthesized by chemical synthesis 
methods (ed: Gait, M. J. 1984 in: “Oligonucleotide Synthe- 
sis”, IRL Press, Oxford, England). 

One embodiment of the present invention provides nucleic 
acid primers that are complementary to any of the sequences 
shown in FIGS. 21A, 22A, 23A, 24A, 25A, 26A, 27A, or 
29A, which allow specific amplification of nucleic acid mol- 
ecules of the invention or of any specific portions thereof. 

Alternatively, a fragment of the modified chaperonin sub- 
unit molecule sequence can be used to construct a recombi- 
nant fusion gene having the modified chaperonin subunit 
sequence fused to a different sequence. 

Codon Usage Variants 

The present invention provides isolated codon-usage vari- 
ants that differ from the disclosed modified chaperonin sub- 
unit nucleotide sequences, yet do not alter the predicted 
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polypeptide sequence or biological activity of the encoded 
modified chaperonin subunit polypeptide. For example, a 
number of amino acids are designated by more than one 
triplet codon. Codons that specify the same amino acid can 
5 occur due to degeneracy in the genetic code. Examples 
include nucleotide codons CGT, CGG, CGC, and CGA 
encoding the amino acid, arginine (R); or codons GAT, and 
GAC encoding the amino acid, aspartic acid (D). Thus, a 
protein can be encoded by one or more nucleic acid molecules 
10 that differ in their specific nucleotide sequence, but still 
encode protein molecules having identical sequences. The 
amino acid coding sequence is as follows: 


Amino Acid 

Symbol 

One Letter 
Symbol 

Codons 

Alanine 

Ala 

A 

GCU, GCC, GCA, GCG 

Cysteine 

Cys 

C 

UGU, UGC 

Aspartic Acid 

Asp 

D 

GAU, GAC 

Glutamic Acid 

Glu 

E 

GAA, GAG 

Phenylalanine 

Phe 

F 

uuu, uuc 

Glycine 

Gly 

G 

GGU, GGC, GGA, GGG 

Histidine 

His 

H 

CAU, CAC 

Isoleucine 

lie 

I 

AUU, AUC, AUA 

Lysine 

Lys 

K 

AAA, AAG 

Leucine 

Leu 

L 

UUA, UUG, CUU, CUC, 
CUA, CUG 

Methionine 

Met 

M 

AUG 

Asparagine 

Asn 

N 

AAU, AAC 

Proline 

Pro 

P 

CCU, CCC, CCA, CCG 

Glutamine 

Gin 

Q 

CAA, CAG 

Arginine 

Arg 

R 

CGU, CGC, CGA, CGG, 
AGA, AGG 

Serine 

Ser 

S 

UCU, UCC, UCA, UCG, 
AGU, AGC 

Threonine 

Thr 

T 

ACU, ACC, ACA, ACG 

Valine 

Val 

V 

GUU, GUC, GUA, GUG 

Tryptophan 

Tip 

w 

UGG 

Tyrosine 

Tyr 

Y 

UAU, UAC 


The codon-usage variants can be generated by recombi- 
nant DNA technology. Codons can be selected to optimize the 
40 level of production of the RNA transcripts encoding the modi - 
fied chaperonin subunit polypeptides in a particular prokary- 
otic or eukaryotic expression host. Codons can be selected to 
extend the half-life or increased stability of the transcript 
encoding the modified chaperonin subunit polypeptide. The 
45 present invention provides selecting every possible triplet 
codon to generate every possible combination of nucleotide 
sequences that encode the modified chaperonin subunit 
polypeptides of the present invention. This particular embodi- 
ment provides isolated nucleotide sequences that vary from 
50 the sequences as described in described in any one of FIGS. 
21 A, 22A, 23 A, 24A, 25A, 26A, 27A, or 29A, such that each 
variant nucleotide sequence encodes a molecule having 
sequence identity with the amino acid sequence described in 
FIGS. 21B, 22B, 23B, 24B, 25B, 26B, 27B, or 29B. 

55 

Variant Nucleotide Sequences 

The present invention provides nucleic acid molecules 
comprising polynucleotide sequences encoding variant forms 
of any of the modified chaperonin subunit polypeptides hav- 
60 mg the same or similar structure to a naturally-occurring, 
wild-type chaperonin subunit polypeptide. 

The variant nucleotide sequences of the present invention 
include conservative or non-conservative amino acid substi- 
tutions. The variant nucleotide sequences include mutations 
65 such as amino acid substitutions, deletions, insertions, addi- 
tions, truncations, or processing or cleavage errors of the 
protein. The variant nucleotide sequences include allelic or 
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homolog variants of the naturally-occurring modified chap- 
eronin subunit polypeptides. In one embodiment, the variant 
nucleic acid molecule, which encodes a modified chaperonin 
polypeptide, comprises silent mutations where the nucleotide 
sequence in certain regions differs from that of a wild type 
sequence but still encodes the same amino acids. The silent 
mutations can be useful for suppressing Shine-Dalgarno 
sequences when expressed in bacteria. 

Derivative Nucleic Acid Molecules 

The nucleic acid molecules of the invention also include 
derivative nucleic acid molecules which differ from DNA or 
RNA molecules, and anti-sense molecules. 

Derivative molecules include peptide nucleic acids 
(PNAs), and non-nucleic acid molecules including phospho- 
rothioate, phosphotriester, phosphoramidate, and meth- 
ylphosphonate molecules, that bind to single-stranded DNA 
or RNA in a base pair-dependent manner (P C Zamecnik, et 
al, 1978 Proc. Natl. Acad. Sci. 75:280284; P C Goodchild, et 
al, 1986 Proc. Natl. Acad. Sci. 83:4143-4146). Peptide 
nucleic acid molecules comprise a nucleic acid oligomer to 
which an amino acid residue, such as lysine, and an amino 
group have been added. These small molecules, also desig- 
nated anti -gene agents, stop transcript elongation by binding 
to their complementary (template) strand of nucleic acid (P E 
Nielsen, et al., 1993 Anticancer Drug Des 8:53-63). Reviews 
of methods for synthesis of DNA, RNA, and their analogues 
can be found in: Oligonucleotides and Analogues, eds. F 
Eckstein, 1991, IRL Press, New York; Oligonucleotide Syn- 
thesis, ed. M J Gait, 1984, IRL Press, Oxford, England. Addi- 
tionally, methods for antisense RNA technology are 
described in U.S. Pat. Nos. 5,194,428 and 5,110,802. A 
skilled artisan can readily obtain these classes of derivative 
nucleic acid molecules using the herein described modified 
chaperonin polynucleotide sequences, see for example 
“Innovative and Perspectives in Solid Phase Synthesis” 
(1992) Egholm, et al. pp 325-328 or U.S. Pat. No. 5,539,082. 

Labeled Nucleic Acid Molecules 

The present invention provides nucleic acid molecules of 
the invention linked or labeled with a detectable marker. 
Examples of a detectable marker include, but are not limited 
to, a radioisotope, a fluorescent compound, a bioluminescent 
compound, a chemiluminescent compound, a metal chelator 
or an enzyme. Technologies for generating labeled nucleic 
acid molecules are well known (Sambrook et al., in Molecular 
Cloning (1989)). 

Recombinant Nucleic Acid Molecules 

The present invention provides recombinant nucleic acid 
molecules (e.g., DNA or RNA) comprising nucleotide 
sequences encoding the modified chaperonin subunit 
polypeptides, or a fragment or derivative thereof, as described 
herein. As used herein, a recombinant nucleic acid molecule 
has been subjected to molecular manipulation in vitro. Meth- 
ods for generating recombinant nucleic acid molecules are 
well known in the art (Sambrook et al., Molecular Cloning 
(1989)). In one embodiment, the recombinant DNA mol- 
ecules of the present invention are operably linked to one or 
more expression control sequences and/or vector sequences. 

Vectors 

The recombinant nucleic acid molecules of the present 
invention each comprise the polynucleotide sequence, or 
fragments or derivatives thereof, encoding the modified chap- 
eronin subunit polypeptides joined to a vector to generate a 
recombinant vector molecule. 

The term vector includes, but is not limited to, plasmids, 
cosmids, BACs, YACs, PACs andphagemids. The vector can 
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be an autonomously replicating vector comprising a replicon 
that directs the replication of the rDNA within the appropriate 
host cell. Alternatively, the vector directs integration of the 
recombinant vector into the host cell. Various viral vectors 
5 can also be used, such as, for example, a number of well 
known retroviral and adenoviral vectors (Berkner 1988 Bio- 
techniques 6:616-629). 

The vectors of the invention permit expression of the modi- 
fied chaperonin subunit polypeptides, or fragments or deriva- 
io fives thereof, in prokaryotic or eukaryotic host cells. The 
vectors can be expression vectors, comprising an expression 
control element, such as a promoter sequence, which enables 
transcription of the inserted modified chaperonin subunit 
nucleotide sequence and can be used for regulating the 
15 expression (e.g., transcription and/or translation) of a linked 
modified chaperonin subunit nucleotide sequence in an 
appropriate host cell. 

The expression control elements can be of various origins, 
including naturally -occurring and synthetic. The naturally - 
20 occurring elements can be cellular or viral in origin. Expres- 
sion control elements are known in the art and include, but are 
not limited to, inducible promoters, constitutive promoters, 
secretion signals, enhancers, transcription terminators, and 
other transcriptional regulatory elements. 

25 Other expression control elements that are involved in 
translation are known in the art, and include the Shine-Dal- 
garno sequence (e.g., prokaryotic host cells), and initiation 
and termination codons. Exogenous transcriptional elements 
and initiation codons can be of various origins, both natural 
30 and synthetic. 

The promoters can be inducible which are regulated by 
environmental stress (e.g., heat, oxidizing conditions, or toxic 
compounds), environmental stimuli or the growth medium of 
the cells (IPTG), including those from the genes for heat 
35 shock proteins (e.g., hsp60), alcohol dehydrogenase 2, isocy- 
tochrome C, acid phosphatase, enzymes associated with 
nitrogen catabolism, and enzymes responsible for maltose 
and galactose utilization. 

The promoters can be constitutive including yeast beta- 
40 factor, alcohol oxidase, cytomegalovirus, and PGH. For 
reviews, see Ausubel et al (1 987 Current Protocols in Molecu- 
lar Biology, John Wiley & Sons, New York N.Y.) and Grant et 
al (1987 Methods in Enzymology 153:516-544). 

The promoter can be plac (with a lari' 7 on the vector to 
45 reduce background expression), which can be regulated by 
the addition of isopropylthiogalactoside (IPTG), another pro- 
moter pT7<D 10, which is specific to T7 RNA polymerase and 
is not recognized by E. coli RNA polymerase. T7 RNA poly- 
merase, which is resistant to rifamycin, is encoded on the 
50 defective lambda DE lysogen in the E. coli BL21 chromo- 
some. T7 polymerase in BL2 1 (DE3) is super-repressed by the 
lari' 7 gene in the plasmid and is induced and regulated by 
IPTG. 

The efficiency of transcription can be augmented by the 
55 inclusion of enhancers appropriate to the cell system in use 
(Scharf, D., et al, 1 994 Results Probl. Cell. Differ. 20: 125-62; 
Bittner, etal., 1987 Methods in Enzymol. 153:516-544). Viral 
promoters include SV40 early promoter or the promoter 
included within the LTR of a retroviral vector. Other viral 
60 promoters include the cytomegalovirus promoter (M Bo shart, 
etal, 1985 Cell 41:521-530). 

Commonly used eukaryotic control sequences for use in 
expression vectors include promoters and control sequences 
compatible with mammalian cells such as, for example, CMV 
65 promoter and avian sarcoma virus (ASV) (jtLN vector). 
Other commonly used promoters include the early and late 
promoters from Simian Virus 40 (SV40) (Fiers, et al, 1973 
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Nature 273:113), or other viral promoters such as those 
derived from polyoma, Adenovirus 2, and bovine papilloma 
virus. An inducible promoter, such as hMTII (Karin, et ah, 
1982 Nature 299:797-802) can also be used. 

Transcriptional control sequences for yeast vectors include 
promoters for the synthesis of glycolytic enzymes (Hess et 
al., 1968) J Adv Enzyme Reg. 7:149; Holland et ah, 1978 
Biochemistry 17:4900). Additional promoters known in the 
art include the CMV promoter provided in the CDM8 vector 
(Toyama and Okayama 1990 FEBS 268:217-221); the pro- 
moter for 3-phosphoglycerate kinase (Hitzeman et ah, 1 980 J 
Biol Chem 255:2073), and those for other glycolytic 
enzymes. 

Specific translation initiation signals can also be required 
for efficient translation of a modified chaperonin subunit 
polypeptide. These signals include the ATG-initiation codon 
and adjacent sequences. The ATG-initiation sequences or 
upstream sequences of a naturally-occurring modified chap- 
eronin subunit polypeptides can be inserted into the appro- 
priate expression vector. Alternatively, a synthetic ATG-ini- 
tiation codon and other sequences can be used. The ATG- 
initiation codon must be in the correct reading-frame to 
ensure translation of the insert sequence. 

The expression control elements can be placed at the 3* end 
of the coding sequences. 

These sequences can act to stabilize messenger RNA. Such 
terminators are found in the 3' untranslated region following 
the coding sequences in several yeast-derived and mamma- 
lian genes. 

The expression vector can include at least one selectable 
marker gene encoding a gene product that confers drug resis- 
tance such as resistance to kanamycin, ampicillin, chloram- 
phenicol or tetracycline. 

The expression vector can include any marker gene. These 
include, but are not limited to, the herpes simplex virus thy- 
midine kinase (M Wigler et al., 1977 Cell 11:223-32) and 
adenine phosphoribosyltransferase (I Lowy et al., 1980 Cell 
22:817-23) genes which can be employed in tk-minus or 
aprt-minus cells, respectively. Also, antimetabolite, antibiotic 
or herbicide resistance can be used as the basis for selection; 
for example, dhfr which confers resistance to methotrexate 
(M Wigler et al, 1980 Proc Natl Acad Sci 77:3567-70); npt, 
which confers resistance to the aminoglycosides neomycin 
and G-418 (F Colbere-Garapin et al, 1981 J. Mol. Biol. 
1 50: 1 - 14) and als or pat, which confer resistance to chlorsul- 
furon and phosphinotricin acetyltransferase, respectively (L 
E Murry, in: McGraw Yearbook of Science and Technology 
(1992) McGraw Hill New York N.Y, pp 191-196). Additional 
selectable genes have been described, for example, trpB, 
which allows cells to utilize indole in place of tryptophan, or 
hisD, which allows cells to utilize histinol in place of histidine 
(Hartman, and Mulligan 1 988 Proc. Natl. Acad. Sci. 85:8047- 
51). Recently, the use of visible markers has gained popular- 
ity with such markers as anthocyanins, (3 -glucuronidase and 
its substrate, GUS, and luciferase and its substrate, luciferin, 
being widely used not only to identify transformants, but also 
to quantify the amount of transient or stable protein expres- 
sion attributable to a specific vector system (C A Rhodes et al, 
1995 Methods Mol. Biol. 55:121-131). 

The vector also comprises multiple endonuclease restric- 
tion sites that enable convenient insertion of exogenous DNA 
sequences. Methods for generating a recombinant expression 
vector encoding the modified chaperonin subunit polypep- 
tides are well known in the art (T Maniatis, et al, 1989 
Molecular Cloning, A Laboratory Manual, Cold Spring Har- 
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bor Laboratory, Cold Spring Harbor; N.Y.; F Ausubel, et al. 
1 989 Current Protocols in Molecular Biology, John Wiley & 
Sons, New York N.Y.). 

The expression vectors can include expression control ele- 
5 ments for expression in bacterial host cells . These expression 
control elements can be induced by environmental conditions 
such as heat-shock, or by addition of agents such as isopro- 
pyl-p-D-thiogalactopyranoside (e.g, IPTG) (N Yamaguchi, 
et al. 2002 The J of Biol Chem 277:6806-6812). Prokaryotic 
to cell expression vectors are well known in the art and are 
available from several commercial sources. For example, 
pET19b (Novagen, Madison, Wis.), Superlinker vectors 
pSE280 and pSE380 (Invitrogen, San Diego, Calif.), pGEX 
vector (Promega, Madison, Wis.), pTrcHisB vector (Invitro- 
15 gen), pET vector (e.g, pET-21, Novagen Corp.), BLUE- 
SCRIPT phagemid (Stratagene, LaJolla, Calif.), pSPORT 
(Gibco BRL, Rockville, Md.), or ptrp-lac hybrids can be used 
to express the modified chaperonin subunit polypeptides in 
bacterial host cells. 

20 The expression vectors used for generating modified chap- 

eronin subunit polypeptides can be compatible with eukary- 
otic host cells. The vectors can be compatible with vertebrate 
cells. These vectors can include expression control elements 
such as promoters and/or enhancers from mammalian genes 
25 or mammalian viruses. Other expression vectors can include 
tissue- specific or cell-specific promoters and/or enhancers 
from mammalian genes or mammalian viruses. 

The expression vectors can be compatible with other 
eukaryotic host cells, including insect, plant, or yeast cells. 
30 The expression vectors can include expression control ele- 
ments, such as the baculovirus polyhedrin promoter for 
expression in insect cells. The promoters and/or enhancers 
derived from plant cells (e.g, heat shock, RUBISCO, storage 
protein genes), viral promoters or leader sequences or from 
35 plant viruses can also be used. 

Eukaryotic cell expression vectors are well known in the 
art and are available from several commercial sources, 
including PSVL and pKS V- 1 0 (Pharmacia), pBPV- 1 /pML2d 
(International Biotechnologies, Inc.), pTDTl (ATCC, 
40 #31255), and similar eukaryotic expression vectors. 
Examples of expression vectors for eukaryotic host cells 
include, but are not limited to, vectors for mammalian host 
cells including: BPV-1; pHyg; pRSV; pSV2; pTK2 (Mania- 
tis); pIRES (Clontech); pRc/CMV2; pRc/RSV; pSFVl (Life 
45 Technologies); pVPakc Vectors; pCMV vectors; pSG5 vec- 
tors (Stratagene); retroviral vectors (e.g, pFB vectors (Strat- 
agene)); pCDNA-3 (Invitrogen) or modified forms thereof; 
adenoviral vectors; Adeno -associated virus vectors; bacu- 
lovirus vectors. Other expression vectors for eukaryotic host 
50 cells include pESC vectors (Stratagene) for yeast and pFast- 
Bac for expression in insect cells (Gibco/BRL, Rockville, 
Md.). 

Host-Vector Systems 

The present invention further provides a host-vector sys- 
tem comprising a vector comprising a modified chaperonin 
subunit nucleotide sequence, or a fragment or derivative 
thereof, introduced into a suitable host cell. 

The host-vector system can be used to transcribe and/or 
express (e.g, produce) the modified chaperonin subunit 
polypeptides of the invention. The host cell can be either 
prokaryotic or eukaryotic. 

Prokaryotic Host Cells 

Examples of suitable prokaryotic host cells include bacte- 
65 ria strains from genera such as Escherichia, Bacillus, 
Pseudomonas, Streptococcus , and Streptomyces . In one 
embodiment, E. coli strain BL21(DE3) and BL21(DE3/ 
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pLysS) (Novagen, Madison, Wis.), although other compat- 
ible recA strains, such as HMS174(DE3) and HMS174(DE3/ 
pLysS) can be used. 

In bacterial systems, a number of expression vectors can be 
selected depending upon the use intended for the modified 
chaperonin subunit polypeptides. For example, when large 
quantities of the modified chaperonin subunit polypeptides 
are desired, vectors that direct high level expression of fusion 
proteins that are soluble and readily purified can be desirable. 
Such vectors include, but are not limited to, the multifunc- 
tional E. coli cloning and expression vectors such as BLUE- 
SCRIPT (Stratagene), in which the modified chaperonin sub- 
unit nucleotide sequence can be ligated into the vector 
in-frame with sequences for the amino -terminal Met and the 
subsequent 7 residues of galactosidase so that a hybrid protein 
is produced. Other vectors include the pET19b (Novagen), 
pIN vectors (Van Heeke & Schuster 1989 J Biol Chem 264: 
5503-5509), and the like. The pGEX vectors (Promega, 
Madison Wis.) can also be used to express foreign proteins as 
fusion proteins with glutathione S-transferase (GST). In gen- 
eral, such fusion proteins are soluble and can easily be puri- 
fied from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. 
Proteins made in such systems are designed to include hep- 
arin, thrombin or factor Xa protease cleavage sites so that the 
cloned protein of interest can be released from the GST moi- 
ety at will. 

The methods for introducing the modified chaperonin sub- 
unit nucleotide sequences into the host cells are well-known 
methods that depend on the type of vector used and host 
system employed. For example, prokaryotic host cells are 
typically introduced (e.g., transformed) with nucleic acid 
molecules by electroporation or salt treatment methods (Co- 
hen et al., 1972 Proc Acad Sci USA 69:21 10; Maniatis, T., et 
al., 1 989 in: Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 

Eukaryotic Host Cells 

Examples of suitable eukaryotic host cells include insect 
cells, yeast cells, plant cells, or animal cells such as mamma- 
lian cells. 

An expression system that can be used to express modified 
chaperonin subunit polypeptides is an insect system. In one 
such system, Autographa californica nuclear polyhedrosis 
virus (AcNPV) can be used as a vector to express foreign 
genes in Spodoptera frugiperda insect cells or in Trichoplusia 
larvae. The sequence encoding a modified chaperonin sub- 
unit polypeptides can be cloned into a nonessential region of 
the virus, such as the polyhedrin gene, and placed under 
control of the polyhedrin promoter. Successful insertion of a 
modified chaperonin subunit sequence will render the poly- 
hedrin gene inactive and produce recombinant virus lacking 
coat protein. The recombinant viruses can then used to infect 
S. frugiperda cells or Trichoplusia larvae in which the modi- 
fied chaperonin subunit polypeptides can be expressed 
(Smith et al 1983 J Virol 46:584; E K Engelhard, et al, 1994 
Proc Nat Acad Sci 91:3224-3227). 

In mammalian host cells, a number of viral-based expres- 
sion systems can be utilized. In cases where an adenovirus is 
used as an expression vector, a modified chaperonin subunit 
nucleotide sequence can be ligated into an adenovirus tran- 
scription/translation vector having the late promoter and tri- 
partite leader sequence. Insertion in a nonessential El or E3 
region of the viral genome results in a viable virus capable of 
expressing a modified chaperonin subunit polypeptides in 
infected host cells (Logan and Shenk 1 984 Proc Natl Acad Sci 
81 :3 65 5 -5 9). In addition, transcription enhancers, such as the 
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rous sarcoma virus (RSV) enhancer, can be used to increase 
expression in mammalian host cells. 

In yeast, Saccharomyces cerevisiae, a number of vectors 
including constitutive or inducible promoters such as beta- 
5 factor, alcohol oxidase and PGH can be used. For reviews, see 
Ausubel et al (Current Protocols in Molecular Biology, John 
Wiley & Sons, New York N.Y.) and Grant et al (1 987 Methods 
inEnzymology 153:516-544). 

In cases where plant expression vectors are used, the 
to expression of a sequence encoding a modified chaperonin 
subunit polypeptides can be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 
19S promoters of CaMV (Brisson, et al., 1984 Nature 310: 
511-514) can be used alone or in combination with the omega 
15 leader sequence from TMV (Takamatsu, et al ., 1 987 EMBO J 
6:307-31 1). Alternatively, plant promoters such as the small 
subunit of RUBISCO (Coruzzi et al 1984 EMBO J 3:1671- 
1680; Broglie etal 1984 Science 224:838-843); or heat shock 
promoters (J Winter and R M Sinibaldi 1991 Results Probl 
20 Cell Differ. 17:85-105) can be used. 

In addition, a host cell strain can be chosen for its ability to 
modulate the expression of the inserted modified chaperonin 
subunit nucleotide sequences or to process the expressed 
protein in the desired fashion. Such modifications of the 
25 expressed modified chaperonin subunit polypeptides include, 
but are not limited to, acetylation, carboxylation, glycosyla- 
tion, phosphorylation, lipidation and acylation. Post-transla- 
tional processing which cleaves a precursor form of the pro- 
tein (e.g., a pre-pro protein) can also be important for correct 
30 insertion, folding and/or function. Different host cells such as 
CHO, HeLa, MDCK, 293, WI38, etc. have specific cellular 
machinery and characteristic mechanisms for such post- 
translational activities and can be chosen to ensure the correct 
modification and processing of the introduced, foreign pro- 
35 tein. 

The methods for introducing the modified chaperonin sub- 
unit nucleotide sequences into eukaryote host cells are well- 
known methods that depend on the type of vector used and 
host system employed. For example, in vertebrate cells, the 
40 nucleic acid sequences are introduced with vectors using 
various methods, including calcium phosphate-mediated 
DNA transfection (Graham and Van der Eb 1973 Virology 
52:456-467; M Wigler, et al 1977 Cell 1 1 :223-232) or other 
cationic-mediated transfection methods, electroporation (E 
45 Neuman, et al 1982 EMBO J 1 :84 1-845), microinjection (W 
F Anderson, et al 1980 Proc Natl Acad Sci USA 77:5399- 
5403; M R Cappechi 1980 Cell 22:479-488; A Graessman, et 
al 1979 J Virology 32:989-994), or lipid methods including 
encapsulation of DNA in lipid vesicles (M Schaefer-Ridder 
50 1 982 Science 215:1 66-168). Other methods include the par- 
ticle gun method. Still other methods include using an aden- 
ovirus transcription/translation vector comprising the late 
promoter and tripartite leader sequence. A nucleic acid 
sequence can be inserted in a nonessential El or E3 region of 
55 the adenoviral genome to create a viable virus capable of 
expressing the protein encoded by the nucleic acid sequence 
(Logan and Shenk 1984 Proc Natl Acad Sci 81:3655-59). 
Alternatively, retroviral transfer methods can be used (E 
Gibloa, et al 1986 BioTechniques 4:504-512). 

60 Plant cells can be introduced by direct DNA transformation 

or pathogen-mediated transfection. For reviews of such tech- 
niques, see Hobbs, S. in: “McGraw Yearbook of Science and 
Technology” (1992) McGraw Hill New York N.Y., pp 191 - 
196; or Weissbach and Weissbach (1988) in: “Methods for 
65 Plant Molecular Biology”, Academic Press, New York N.Y., 
pp 421-463. Alternatively, plant cells can be introduced via a 
particle-gun method using metal particles. 
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Selection of Transformed Cells 

The host cells introduced with the modified chaperonin 
subunit nucleotide sequences can be identified by techniques 
well known in the art. The cells can be selected, lysed and 
their DNA content examined for the presence of the intro- 
duced sequences using a DNA gel blot method or similar 
method (Southern 1 975 J Mol Biol 98:503; Berent et ah, 1 985 
Biotech 3:208). Alternatively, the proteins produced from the 
cells of the invention can be assayed via a biochemical assay 
or immunological method. 

Any number of selection systems can be used to recover the 
introduced (e.g, transformed or transfected) cells. The intro- 
duced cells can be selected based on expression of herpes 
simplex virus thymidine kinase (Wigler, M., et ah, 1977 Cell 
11:223-32), or adenine phosphoribosyltransferase (Lowy, I. 
et al., 1980 Cell 22:817-23) genes which can be employed in 
tk-minus or aprt -minus cells, respectively. Also, antimetabo- 
lite, antibiotic or herbicide resistance can be used as a basis 
for selection; for example, dhfr which confers resistance to 
methotrexate (Wigler, M., et al., 1980 Proc Natl Acad Sci 
77:3567-70); npt, which confers resistance to the aminogly- 
cosides neomycin and G-418 (Colbere-Garapin, F., et al., 
1981 J. Mol. Biol. 150:1-14) and als or pat, which confer 
resistance to chlorsulfiiron and phosphinotricin acetyltrans- 
ferase, respectively. Additional selectable genes have been 
described, for example, trpB, which allows cells to utilize 
indole in place of tryptophan, or liisD, which allows cells to 
utilize histinol in place of histidine (S C Hartman and R C 
Mulligan 1988 Proc. Natl. Acad. Sci. 85:8047-51). Recently, 
the use of visible markers has gained popularity with such 
markers as anthocyanins, beta -glucuronidase and its sub- 
strate, GUS, and luciferase and its substrate, luciferin, being 
widely used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression 
attributable to a specific vector system (C Rhodes, et al., 1 995 
Methods Mol. Biol. 55:121-131). 

Producing Modified Polypeptides 

The present invention provides methods for producing the 
modified chaperonin subunit polypeptides. The modified 
polypeptides of the invention, or fragments or derivatives 
thereof, can be produced as recombinant molecules produced 
in prokaryote or eukaryote host cells, or generated as syn- 
thetic molecules. 

The modified polypeptides can be produced using standard 
recombinant technology techniques. For example, expression 
vectors comprising nucleotide sequences encoding the modi- 
fied chaperonin subunit polypeptides can be introduced into a 
prokaryote or eukaryote host cell (i.e., a host- vector system), 
the host- vector system can be cultured under conditions suit- 
able for the host cell to produce the modified polypeptides, 
and the polypeptides so produced can be isolated. The iso- 
lated modified polypeptides can be enriched to further sepa- 
rate the modified polypeptides away from other non-desirable 
polypeptides or cellular fractions. The isolated modified 
polypeptides can be purified to produce a sample of modified 
polypeptides having very little or no contaminating non-de- 
sirable polypeptide or cellular fractions. 

In one embodiment, the host- vector system can be cultured 
at non-heat stress temperatures (e.g., 75 degrees C.). In 
another embodiment, the host-vector system can be cultured 
at heat stress temperatures (e.g., 86 degrees C.). 

In a non-limiting example, a sequence encoding a thermo- 
stable chaperonin, e.g. a thermostable HSP60, can be trans- 
ferred into E. coli and grown at temperatures standard for the 
cell. The expressed polypeptide can then be easily purified 
from E. coli proteins by heating and centrifugation. The ther- 
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mo labile E. coli proteins precipitate leaving the thermostable 
polypeptide greater than 90% pure after a centrifugation. 

In an embodiment where the chaperonin polypeptides are 
thermostable extremophiles, the cell extracts can be heated 
5 for easier purification of the chaperonin subunits. For 
example, the purification of the chaperonin beta subunit of 
Sulfolobus shibatae expressed in E. coli involves heating total 
cell extracts to 85° C. for 30 minutes, which precipitates most 
E. coli proteins, but the thermostable beta remains soluble. 
10 Therefore, heating and centrifuging cell extracts separates the 
beta subunit from most E. coli proteins, which simplifies 
further purification using ion exchange chromatography (Ka- 
gawa, H. K. et al., 1995 The 60 kDa heat shock proteins in the 
hyperthermophilic archaeon Sulfolobus shibatae. J Mol Biol 
15 253:712-25). 

In one non-limiting example, a purification procedure 
comprises, either alone or in combination: 1) chromatogra- 
phy on molecular sieve, ion-exchange, and/or hydrophobic 
matrices; 2) preparative ultracentrifiigation; and 3) affinity 
20 chromatography. 

Assembled Chaperonin Structures 

The present invention provides assembled chaperonin 
structures comprising at least one of the modified chaperonin 
25 polypeptides of the invention from Group I or Group II sub- 
unit polypeptides, or any combination and any proportion of 
Group I and II. The chaperonin structures comprise at least 
one of the modified chaperonin polypeptides from alpha, 
beta, or gamma subunits, or any combination or in any pro- 
30 portion thereof. The chaperonin structures comprise at least 
one modified chaperonin polypeptide from one organism, or 
from different organisms (chimeric chaperonin structure). 
The chaperonin structures comprise at least one modified 
polypeptide which are based on HSP60, TCP-1, thermal fac- 
35 tor 55 (TF55), thermal factor 56 (TF56), or GroEL subunits. 
The choice of subunits can be made depending on factors 
such as operating conditions. For example, if the subunits or 
chaperonin structures are to experience high operating tem- 
peratures, then one skilled in the art could select a modified 
40 polypeptide from extremophiles. The chaperonin structures 
comprise 7, 8, 9 or 10 subunits per ring. The chaperonin 
structures can have 2-, 3-, 4-, 5-, 6-, 7-, 8- 9-, or 10-fold 
symmetry. The chaperonin structures comprise at least on of 
the modified chaperonin polypeptides and a co-chaperonin 
45 (e.g., GroEL and GroES). 

Production In Vivo 

The present invention provides methods for producing the 
double-ringed chaperonin structures in vivo, comprising: cul- 
turing a host cell carrying an expression vector (host -vector 
50 system) under conditions suitable for the cell to produce the 
modified chaperonin polypeptide in the cell and permitting 
self-assembly of the chaperonin structure in the cell; and 
isolating the assembled chaperonin structure from the cell. 

In one embodiment, the host cell co-expresses the modified 
55 chaperonin polypeptide (e.g., GroEL) and a co-chaperonin 
polypeptide (e.g., GroES) so as to self-assemble in the cell the 
chaperonin structure comprising the modified chaperonin 
polypeptide and the co-chaperonin. 

60 Production In Vitro 

The present invention provides methods for producing the 
double-ringed chaperonin structures in vitro. In one embodi- 
ment, the methods comprise: reacting isolated modified chap- 
eronin polypeptides under conditions suitable to permit self- 
65 assembly of the modified chaperonin polypeptides to form 
the chaperonin structures. In another embodiment, the meth- 
ods comprise: reacting isolated modified chaperonin 
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polypeptides with wild-type chaperonin polypeptides under 
conditions suitable to permit self-assembly of the modified 
chaperonin polypeptides and the wild-type chaperonin 
polypeptides to form the chaperonin structures. 

The assembly reaction can be performed using modified 
and/ or wild-type chaperonin polypeptides that are enriched or 
purified. Alternatively, the assembly reaction can be per- 
formed using modified and/or wild-type chaperonin polypep- 
tides from crude cell extracts. The assembly reaction can be 
allowed to proceed prior to purification of the assembled 
chaperonin structures. 

The conditions suitable to permit self-assembly of the 
modified chaperonin polypeptides (with modified chaperonin 
polypeptides or with wild-type chaperonin polypeptides) 
includes Mg 2+ , in the presence of ATP, ADP, AMP-PNP, GTP 
or ATPyS. The temperature or pH for formation will depend 
on the type and thermostability of the modified or wild-type 
chaperonin polypeptides or the chaperonin structures. For 
example, for the thermostable chaperonin beta subunit of S. 
shibatae , the temperature can be 4° C.-85° C., while it may be 
lower for other types of polypeptides (e.g., less than 40° C.). 
For a given modified chaperonin polypeptide, optimal condi- 
tions for self-assembly (i.e., concentration and proportion of 
Mg 2+ to ATP, ADP, AMP-PNP, GTP or ATPyS) are easily 
determined by routine experimentation, such as by changing 
each variable individually and monitoring formation of the 
appropriate products. For formation of the chaperonins any of 
Mg 2+ , ATP, ADP, AMP-PNP, GTP or ATPyS can be present in 
an amount ranging from 1 mM, up to 1 0 mM, 20 mM, 30 mM 
or higher. For example, filaments can be formed in 5 mM 
HEPES buffer with 25 mM MgCl 2 and 1 mM ATP (total 
volume 300 pi) (Yoai et al., 1998 Archives of Biochemistry 
and Biophysics 356: 55-62). While it has been shown that the 
formation of chaperonins from alpha and beta subunits of S. 
shibatae does not depend on the presence of K + , formation of 
the higher order structures from the subunits of other organ- 
isms may require the presence of K + . 

In another embodiment, the chaperonins are formed in the 
absence of introduction of any of Mg 2+ , ATP, ADP, AMP- 
PNP, GTP or ATPyS. At sufficiently high concentrations of 
the chaperonins, e.g., at concentrations of 2-5 mg/ml, or up to 
30 mg/ml or more, some of the higher order structures can 
spontaneously assemble (Quaite-Randall et al., 1995, J. Biol. 
Chem. 270, 28818-28823). The concentration of the chaper- 
onins or chaperonin polypeptides in different embodiments is 
0.1 mg/ml, 0.5 mg/ml, 1 mg/ml, 2, mg/ml, 5 mg/ml, 10 
mg/ml, 20 mg/ml, 30 mg/ml, 50 mg/ml or higher. 

Higher Order Structures 

The present invention provides higher order structures 
comprising the modified chaperonin polypeptides of the 
invention. The higher order structures include rings, tubes, 
filaments, and sheets (2-D crystals). The nanorings can have 
rectangular, pentagonal, hexagonal or heptagonal architec- 
tural arrangements of chaperonins. 

The present invention also provides methods for producing 
the higher order structures. In one embodiment, the methods 
comprise: reacting isolated modified chaperonin polypep- 
tides under conditions suitable to permit self-assembly of the 
modified chaperonin polypeptides to form assembled chap- 
eronin structures which can form the higher order structures. 
In another embodiment, the methods comprise: reacting iso- 
lated modified chaperonin polypeptides with wild-type chap- 
eronin polypeptides under conditions suitable to permit self- 
assembly of the modified chaperonin polypeptides and the 
wild-type chaperonin polypeptides to form assembled chap- 
eronin structures which can form the higher order structures. 
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The conditions suitable to permit formation of the higher 
order structures include: the presence or absence of ATP, 
ADP, AMP-PNP, GTP or ATPyS in the reaction; and/or the 
concentration of the modified and/or wild-type chaperonin 
5 polypeptides. The presence of ATP induces the formation of 
an extensive network of filaments, while using ADP, AMP- 
PNP, GTP or ATPyS induces the formation of shorter fila- 
ments. For example, with respect to reactions comprising 
mutant and/or mutant and wild-type TF55 a and (3 subunits of 
10 S. shibatae , chaperonin structures can be formed at concen- 
trations of approximately 0.1 mg/ml, while filaments are 
formed at approximately 0.5 mg/ml. Longer aligned fila- 
ments can be formed at concentrations of approximately 1 .0 
mg/ml. Other conditions are known for forming structures 
15 like filaments of differing average lengths or two-dimen- 
sional arrays (Yaoi et al., 1998 Archives of Biochemistry and 
Biophysics 356: 55-62; and Trent et al., 1997 Proc. Natl. 
Acad. Sci 94: 5383-5388). Thus, the length of filaments can 
be controlled through manipulation and choice of reaction 
20 conditions, with certain concentrations necessary for particu- 
lar structures being routinely attainable. 

The architectural symmetry of the chaperonin structures 
and higher order structures can be dictated by varying the 
reaction conditions, or through directed binding or arrange- 
25 ment of the chaperonins relative to each other. The architec- 
tural symmetry includes — 1, 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9- or 
10-fold symmetry. For example, nanofilaments having one- 
dimensional architectural symmetry can be formed using cer- 
tain mutant chaperonin polypeptides in the presence of Mg 2+ 
30 and nucleotides. These nanofilaments can cluster to form 
bundles of filaments that are microns in length and with 
bundle diameters of up to microns in thickness. 

FIG. 5 shows that in the electron microscope individual 
HSP60s in the double-rings appear as black “blobs” (A, end 
35 view) or alternating dark and light bands (B, side view). These 
double-rings self-assemble into chains or porous tubes (C) 
and the tubes associate into filaments (D). FIG. 6 shows the 
organization of HSP60 rings into 2 -dimensional crystals on a 
metal grid coated with lipid (A) and filament bundles 
40 arranged on a bed of rings (visible as spots in background) 
(B). In general, the choice of proportion of ATP to Mg 2+ 
affects the formation of higher order structures, such as fila- 
ments or arrays. The higher order structures can have long 
range two- or three-dimensional ordering as in an array with 
45 trigonal or hexagonal close packed architectural arrangement 
of the chaperonins through self-assembly (FIG. 6). 

The various architectural symmetries can also be dictated 
through directed arrangement of the chaperonins onto a sub- 
strate either through a masking technique or by directed bind- 
50 ing (Whaley et al., 2000 Nature 405: 665-668, which 
describes peptides that bind to selectively to specific faces 
gallium arsenide, silicon or indium phosphide). An exem- 
plary, non-limiting list of partial amino-acid sequences from 
clones that bind to different surfaces of GaAs and/or InP 
55 (Whaley et al., 2000 Nature 405: 665-668) includes: 


VTSPDSTTGAMA 

(SEQ 

ID 

NO: 

16) 

AASPTQSMSQAP 

(SEQ 

ID 

NO: 

17) 

AQNPSDNNTHTH 

(SEQ 

ID 

NO: 

18) 

ASSSRSHFGQTD 

(SEQ 

ID 

NO: 

19) 

WAHAPQLASSST 

(SEQ 

ID 

NO: 

20) 

ARYDLSIPSSES 

(SEQ 

ID 

NO: 

21) 
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-continued 


TPPRPIQYNHTS 

SSLQLPENSFPH 

GTLANQQIFLSS 

HGNPLPMTPFPG 

RLELAIPLQGSG 


(SEQ ID NO: 22) 
(SEQ ID NO: 23) 
(SEQ ID NO: 24) 
(SEQ ID NO: 25) 
(SEQ ID NO: 26) 
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Whaley et al. also describes amino-acid sequences that 
bind silicon and not silicon dioxide. An example of an amino- 
acid sequence that binds to ZnS(l 02) (Lee et al., 2002 Science 
296: 892-895) is: 


CNNPMHQNC (SEQ ID NO: 27) 

A list of partial amino-acid sequences from clones that 
bind to Ag (Naik et al., 2002 Nature Materials 1: 169-172) 20 
includes: 


AYSSGAPPMPPF (SEQ ID NO : 28) 

, , 25 

NPSSLFYRLPSD (SEQ ID NO: 29) 

SLATQPPRTPPV (SEQ ID NO : 30) 

A list of partial amino-acid sequences from clones that 
bind to Au (Brown et al., 2000 J. Mol. Biol. 299: 725-735; 30 
Brown, S, 1997 Nature Biotechnol. 15: 269-272) includes: 

MHGKTQATSGTIQS (SEQ ID NO: 31) 

ALVPTAHRLDGNMH (SEQ ID NO: 32) 35 

Another advantage is that HSP60 nanotemplate structures 
such as rings, tubes, and filaments (to be described in detail 
below) bind to DNA and RNA using the method of “gel shift,” 
to proteins by the method of autoradiography, and to lipo- 40 
somes and lipid monolayers. 

Nanostructures, Nanoarrays, Nanodevices 

The modified polypeptides can self-assemble into chapero- 
nin structures which can be used as the basis for producing 
nanotemplates, nanostructures, nanoarrays and nanodevices. 

The term “nanotemplate” as used herein, unless otherwise 
indicated, refers to a composition comprising at least one 
modified chaperonin polypeptide 

The term “nanostructure” as used herein, unless otherwise 50 
indicated, refers to a composition comprising at least one 
modified chaperonin polypeptide and one or more nanoscale 
materials, such as nanoparticles and/or quantum dots. 

The term “nanoarray” as used herein, unless otherwise 
indicated, refers an ordered arrangement of at least one modi - 55 
fied chaperonin polypeptide and/or nanostructures. 

The term “nanodevice” as used herein, unless otherwise 
indicated, refers to a device comprising at least one modified 
chaperonin polypeptide, at least one nanostructure, and/or at 
least one nanoarray. Exemplary devices include, but are not 60 
limited to, electronic, semiconductor, mechanical, nanoelec- 
tromechanical, magnetic, photonic, optical, optoelectronic or 
biomedical devices. 

The term “nanounit” as used herein, unless otherwise indi- 
cated, refers any of the components or “basic building blocks” 65 
of a nanostructure, including, for example: a nanscale object, 
such as a nanoparticle or a quantum dot; a modified chapero- 
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nin polypeptide; and a wild-type chaperonin; a wild-type 
chaperonin polypeptide; a mutant chaperonin; or a chapero- 
nin polypeptide. 

In a specific embodiment, the nanotemplate can comprise a 
chaperonin structure comprising at least one modified chap- 
eronin polypeptide, which serve as “spacers” in the nanotem- 
plates. The spacer chaperonins can be confined to specific 
regions of the nanotemplate, and would not present specific 
binding sites for any of polypeptides, nano scale materials or 
linker molecules. The spacers can therefore serve a similar 
function as a mask in semiconductor fabrication. 

The generation of several different mutations of a given 
subunit can result in differences in dimension of the resulting 
chaperonins that comprise the nanotemplate. For example, a 
variant produced through the removal of a 28 amino acid loop 
at the apical end from of the (3 subunit of S. shibatae resulted 
in a chaperonin with an expanded internal pore diameter of 
from 2.5 nm to 9 nm (see FIGS. 7B-D). This can be exploited 
in forming a nanotemplate with different mixtures of chap- 
eronin subunit variants to present pores with different pore 
diameters for the binding of nanosale objects such as nano- 
particles and/or quantum dots. 

The chaperonins and/or nanotemplates can differ accord- 
ing to the types of subunits and also the combinations of types 
of subunits used in formation. For example, in vitro alpha and 
beta subunits of S. shibatae form homo -oligomeric rosetta- 
somes, while mixtures of alpha, beta, and gamma form het- 
ero-oligomeric rosettasomes. It has also been found that beta 
homo -oligomeric rosettasomes and all hetero-oligomeric 
rosettasomes of S. shibatae associate into filaments. FIG. 15 
shows the protein sequence alignment of S. shibatae TF55 
alpha subunit (SEQ ID NO: 39), beta subunit (SEQ ID NO: 1 ) 
and gamma subunit (SEQ ID NO: 38). In vivo rosettasomes 
are hetero-oligomeric with an average subunit-ratio of 1 a: 1 (3 : 
0.1 y in cultures grown at 75° C., a ratio of 1 a:3 13: ly in cultures 
grown at 60° C., and a ratio of 2a:3(3:Oy after 86° C. heat 
shock. Additionally, it has been observed that rosettasomes 
containing gamma were relatively less stable than those with 
alpha and/or beta subunits. A protein sequence alignment of 
the alpha, beta, gamma subunits of S. shibatae (see Figure), 
also provides useful information for positioning mutations on 
the chaperonin polypeptides. FIGS. 16A and 16B provide the 
DNA and amino-acid sequences of isolated^, shibatae TF55- 
Y- 

The isolated chaperonin polypeptide subunits from a given 
organism can assemble into different types of nanotemplates 
and other higher order structures (Kagawa et al., 2003 
Molecular Microbiology 48: 143-156). The isolated S. shiba- 
tae TF 55 alpha subunit (SEQ ID NO: 39) alone forms discrete 
homo -oligomeric rosettasomes with the characteristic nine- 
fold ring member symmetry, and arrays of rosettasomes. The 
isolated S. shibatae TF55 beta subunit (SEQ ID NO: 1) forms 
filaments of rosettasomes and bundles of filaments. The iso- 
lated S. shibatae TF55 gamma subunit (SEQ ID NO: 38) does 
not assemble into rosettasomes, but forms amorphous aggre- 
gates and non-uniform round objects, which were seen in the 
TEM (FIG. 6C). Varying the proportions of the different 
subunits from a given organism can also result in the assem- 
bly of different higher order structure being formed (Kagawa 
et al., 2003 Molecular Microbiology 48:143-156). A 1:1:1 
mixture of S. shibatae TF55 alpha, beta, and gamma subunits 
results in hetero-oligomeric rosettasomes and filaments that 
were less bundled than the ones formed from isolated beta 
subunits. The 1 : 1 mixture of S. shibatae TF55 alpha and beta 
subunits results in filaments that are indistinguishable from 
filaments formed by the 1:1:1 mixtures of alpha, beta and 
gamma. 
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In one embodiment, the higher order structures, such as the 
nanotemplates and nanostructures, comprise at least one iso- 
lated S. shibatae TF55 gamma subunit. This embodiment of 
the invention can comprise mutated or wild-type chaperonin 
polypeptides. In a specific embodiment, the higher order 
structures, comprise at least one isolated S. shibatae TF55 
gamma subunit and wild-type chaperonin polypeptides. 

In another embodiment, the nanotemplate forms part of a 
coating or a nanofabric. Due to the capability of the chaper- 
onins to self-assemble in an ordered arrangement on a fairly 
large length scale as compared to their pore diameters, they 
can be applied in these areas that could take advantage of the 
capability. Additionally, the resulting coating or nanofabric 
can be made to include optical, electric, magnetic, catalytic, 
or enzymatic moieties as functional units. These are produced 
through the selected placement of different nanoscale mate- 
rials the apical domain of the chaperonin, e.g., near the pores 
of the nanotemplates, or on other binding sites of the chap- 
eronin, or in between chaperoning. The inclusion of nanos- 
cale material with the nanotemplates is discussed further in 
the section on nanostructures. 

Changes in the subunit composition that can influence 
volume and reactivity of the central cavity of a chaperonin can 
also be exploited for various applications of the nanotem- 
plates. While not wishing to be limited to a particular theory 
or mechanism, it is noted that the N- and C-termini of chap- 
eronin subunits are believed to project into and occlude the 
central cavity. As such, because these termini can differ 
between subunits of a given species (e.g., rosettasome of S. 
shibatae ), changes in subunit composition of the chaperonin 
can be used to impact on the central cavity. Changes in the 
volume and binding properties of the central cavity of the 
chaperonin can therefore be dictated based on the composi- 
tion of the chaperonin, which can be exploited in the forma- 
tion of nanostructures which present different types of bind- 
ing sites for nano scale materials. In certain embodiments the 
N- and C-termini are deleted. 

The assembly of chaperonin polypeptides, for example 
HSP60s, into such structures as rings, tubes, filaments, and 
sheets (2-D crystals) can be regulated chemically. The assem- 
bly can be manipulated by, for example, the proportion of 
ATP/Mg 2+ and/or by manipulating the concentration of these 
regions. HSP60-rings, tubes, and filaments can, for example, 
function as nano-vessels if they are able to absorb, retain, 
protect and release gases or chemical reagents, including 
reagents of medical or pharmaceutical interest. On a nano- 
scale, the filamentous structures, preferably HSP60 struc- 
tures, are hollow and chemicals that are diffused or bound 
inside can be bound or released under programmed condi- 
tions at targeted locations. 

The structures, e.g., rings, tubes, and filaments, can be 
induced to form ordered structures on surfaces. Under con- 
trolled conditions the chaperonins are observed to form 2-di- 
mensional crystals on surfaces and the filament bundles may 
be oriented on surfaces. In an alternate embodiment, the 
nanotemplate functions as a multi-nanowell assay plate, or a 
single-molecule probe for DNA detection and hybridization. 

Layers of interwoven chaperonin filaments may form a 
nano-fabric. Such fabrics may be induced to form on lipid 
layers and may ultimately be used to coat surfaces of mate- 
rials. This may be of value in medical transplants in which the 
material could be coated with, e.g., an HSP60 (derived from 
the host organism) fabric from the host and thereby limit the 
immune response against the transplant. 

Fabrics or two-dimensional crystals of chaperonins com- 
prising HSP60 can form nano-arrays of DNA or RNA by 
taking advantage of the intrinsic affinity of HSP60s for 
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nucleic acids. Such arrays would represent an unprecedented 
density of DNA probes and thereby greatly amplify the den- 
sity of information per unit area. Other kinds of probes based 
on other molecules that associate with HSP60 can also be 
5 developed. 

For characterization, electron microscopy and electron 
probing methods (EDAX) can be used for investigating the 
contents of nano vessels, the continuity of nano-wires, the 
product of template experiments, and the nature of nano- 
10 fabrics. Atomic force microscope (AFM) can be used in 
imaging and analyzing features of these nanotemplates. The 
DNA nano-arrays can be tested by hybridization methods. 

The present invention provides methods for forming nano- 
structures. The chaperonins offer many advantages over other 
15 molecules for the controlled assembly of complex architec- 
tures, in their ability to self-assemble. A nanostructure can be 
formed from a selective placement process involving self- 
assembly, or directed binding, depending on the desired 
resulting architectural arrangement. The steps in the forma - 
20 tion of a nanostructure can include adding one or more 
nanounits comprising (i) at least one nanotemplate, (ii) at 
least one wild-type chaperonin, or (iii) a mixture of (i) and (ii) 
to a surface, and adding one or more nanounits comprising (i) 
at least one nanoparticle, (ii) at least one quantum dot, or (iii) 
25 a combination of (i) and (ii) to said surface. Any unbound 
nanounits are removed in order to maintain the desired archi- 
tecture. Each of the addition steps are repeated as many times 
as necessary to result in a nanostructure. Optimal conditions 
for assembly (i.e., concentration and proportion of Mg 2+ to 
30 ATP, ADP, AMP-PNP, GTP or ATPyS) are easily determined 
by routine experimentation, such as by changing each vari- 
able individually and monitoring formation of the appropriate 
products. In alternate embodiment, the nanostructures 
assemble in the absence of any of Mg 2+ , ATP, ADP, AMP- 
35 PNP, GTP or ATPyS. In yet other embodiments, assembly 
may require the presence of K + . 

The resulting nanostructures utilize proteins to control the 
assembly of structures that may, in certain embodiments, 
4Q incorporate organic materials or inorganic materials such as 
metallic, semiconducting or magnetic nanoparticles 
(Bruchez et al., 1998 Science 281 : 2013-16; Peng et al., 2000 
Nature 404(6773): 59-61; Whaley et al, 2000 Nature 405: 
665-68). 

45 For the formation of a nanostructure, nanoscale materials 
can be combined with the chaperonin polypeptides and/or 
chaperonins under suitable conditions (e.g, concentration 
and proportion of Mg 2+ , K + , ATP, ADP, AMP-PNP, GTP or 
ATPyS). The nanoscale material (i.e, the nanoparticle or 
50 quantum dot) can be attached to the chaperonin and/or the 
polypeptide subunits at specific binding sites prior to assem- 
bly of the nanostructure. The nanoscale materials can be 
introduced before the formation of the nanotemplates, e.g. , by 
being directly bound to a subunit, prior to assembly of the 
55 various subunits and/or chaperonins into the nanostructures. 
In an alternate embodiment, the nano scale material is 
attached to specific binding sites after the nanotemplate is 
assembled. In such an embodiment, a nanotemplate is first 
formed, with the selected sites for binding of the nanostruc- 
60 tures present on pre-determined locations of the nanotem- 
plates, and then the nanostructures are introduced. 

In another embodiment, the nanoparticles are coated with a 
coating that allows specific binding of the nanostructures to 
the pre-determined locations on the nanotemplates. FIG. 10 A 
65 shows a gold particle derivatized with surface-accessible, 
thiol -reactive maleimide groups (monomaleimido Nanogold, 
Nanoprobes, Inc.). The nanogold quantum dots were 
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covalently bound to the mutant beta subunit of S. shibatae 
with a cysteine presented as a binding site. 

In other embodiments, the nanoscale materials are coated 
with an inorganic and/or organic compounds, a polymer, a 
protein, a peptide, hormones, antibodies, nucleic acids, 
receptors, reactive chemical groups, binding agents and the 
like. For example, the nanoscale materials can be coated with 
a polyethylene glycol compound containing chemically reac- 
tive amine groups. 

In yet another embodiment, the nanoscale materials are 
coated with biotin or streptavidin. In a specific embodiment, 
the nano scale materials are coated with bovine serum albu- 
min (BSA) and biotin, and the streptavidin is located at one or 
more binding sites of the nanotemplate. In another example, 
amino acids, or small peptides are coated directly on the 
surface of the nanoscale materials, or are chemically linked to 
polymers or other type of macromolecules. 

Examples of nanoscale materials include, but are not lim- 
ited to, nanoparticles, such as gold, silver and other metal 
nanoparticle or composite nanoparticles of the metals; quan- 
tum dots (QD), including CdSe— ZnS, CdS, ZnS, CdSe, InP, 
InGaAs, CuCl, and InAs quantum dots, silicon nanocrystals 
and nanopyramids, silver nanoparticles; or magnetic quan- 
tum dots, e.g., nanomagnets, such as CoCu, FeCu, NiFe/Ag, 
and CoAg nanomagnets. The nanoscale materials can com- 
prise one or more materials, or combinations of materials, 
such as transition metals, including gold, silver, zinc, cad- 
mium, platinum, palladium, cobalt, mercury or nickel; alkali 
or alkaline earth metals, including sodium, potassium, cal- 
cium or cesuim; group III elements, including, aluminum, 
gallium or indium; group IV elements, including, silicon, 
germanium, tin or lead; group V elements, including, phos- 
phorous, arsenic, antimony, or bismuth; or group VI ele- 
ments, including, sulfur, selenium or tellurium. The listed 
materials can be in any given combination. Examples of III -V 
compounds include GaAs orAlGaAs. The nano scale material 
could also be a fullerene, a carbon nanotube, or a dielectric, 
polymeric, or semiconducting nanoparticle. In an alternate 
embodiment, flexible protein joints may be added to rigid 
carbon nanotubes to increase the diversity of possible forms 
while maintaining the functional features inherent in both 
kinds of nano -structures. 

The size of the nanoscale material can be about 0.5 nm, 1 
nm, about 10 nm, about 50 nm, about 100 nm, about 200 nm, 
or about 500 nm, or more. The size of the nanoparticles can 
depend on the location of the binding site on the nanotem- 
plate. If the binding site is at an apical domain, or within the 
internal cavity of the chaperonin, then the size of nanoscale 
material may correlate with the pore diameter of the chapero- 
nin to which it binds. FIGS. 7C and 7E show that the size of 
the nanoscale material that bind at the apical domain of chap- 
eronins formed from variants of the beta subunits of S. shi- 
batae. FIG. 7C shows an illustration of the 3-nm-pore 2D 
crystal (p312) indicating how 5 nm gold binds within the 
engineered pores. FIG. 7E shows an illustration of the 9-nm- 
pore 2D crystal (p312) indicating how 10 nm gold binds 
within the engineered pores. The nano scale materials may 
also be located in interstitial regions of the nanotemplate, i.e., 
between the chaperonins. The nanoscale materials may be 
bound to more than one chaperonin, such as when the nanos- 
cale material in present in an interstitial site. In another 
embodiment, the nanoscale material is located on top of a 
region of the nanotemplate, and serve as a type of “mask.” In 
this embodiment, the nanoscale material can range up to 500 
nm in size. 
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Morphologies of nanoparticles include, for example, nano- 
pillars, nanocrystals, nanorods, nanotubes, nanowires, 
nanofilaments, nanofibers and composite metal/dielectric 
nano shells. 

5 In a specific embodiment, application of an electric field is 
used to disrupt the nanostructure or the template which may 
be reformed on addition of other reagents with the removal of 
the electric field. 

In an alternate embodiment, differing amounts or propor- 
io tions of ATP, ADP, AMP-PNP, GTP or ATPyS are used to 
disrupt the nanostructure or nanotemplates, or to cause the 
nanoscale material to become unbound from the nanostruc- 
ture or nanotemplate. 

In an embodiment, amino acid tails that do not inhibit their 
15 ability to assemble into rings and tubes are attached to the 
chaperonin polypeptides, e.g., HSP60s, and that allow the 
binding of the nanoscale materials inside the chaperonins 
structure, at an apical, equitorial or intermediate domain, or 
on other locations of the chaperonin. 

20 Mutated chaperonin polypeptides, including HSP60s, can 
form nanometer or micron scale tubes and filaments or arrays 
containing metals or doped or undoped semiconductors, and 
could function as nano-wires, field-effect transistors, 
switches, diodes or logic devices. Given that metals can be 
25 attached to chaperonin polypeptides, their assembly into 
tubes would create a protein coated metal -cored conduit, i.e., 
a wire. By orienting and networking such wires nano-cir- 
cuitry can potentially be created, which may be of value in the 
computer industry. 

30 The nanostructures can also be incorporated into coatings 
with optical, electric, magnetic, catalytic, or enzymatic moi- 
eties as functional units. 

A nanoarray is a nanoscale or microscale ordered arrange- 
ment of nanotemplates and/or nanostructures. A nanoarray, 
35 therefore comprises an ordered array of nanostructures. A 
nanoarray can have any type of long range packing symmetry, 
including 2-, 3-, 4-, or 6-fold packing symmetry. The nanoar- 
ray can be a one-dimensional structure, a two-dimensional 
array, or a three-dimensional array. In a specific embodiment, 
40 where the nanoparticles are dielectrics, a three-dimensional 
nanoarray can be a photonic bandgap crystal. Optimal con- 
ditions for assembly and crystallization of a nanoarray (i.e., 
concentration and proportion of Mg 2+ to ATP, ADP, AMP- 
PNP, GTP or ATPyS) are easily determined by routine experi- 
45 mentation, such as by changing each variable individually 
and monitoring formation of the appropriate products. 

Assembly and higher order organization of mutant chap- 
eronin polypeptides has been previously described (PCT/ 
US02/35889) and can be used as a basis to select the optimal 
50 conditions for assembly and higher order organization using 
the modified chaperonin polypeptides of the present inven- 
tion. For example, in a previous study (PCT/US02/35889), 
two classes of beta mutants of S. shibatae are described 
having a single native cysteine residue changed to a nonreac- 
55 five alanine to prevent potential problems with folding and 
with assembly of mutant subunits. The cysteine is then placed 
at different solvent-exposed sites. The thiols of these cys- 
teines provide binding sites for soft metals including gold and 
zinc. In one class of beta mutants, the exposed cysteine was 
60 placed near the tip of a 28 amino acid loop on the apical 
domain of beta, which in the assembled chaperonin protrudes 
into the central cavity. FIG. 7A-E shows an example of the 
assembly of engineered HSP60s into nanoparticle array tem- 
plates of the preferred embodiment. This mutant chaperonin 
65 has a ring of reactive thiols with a diameter of approximately 
3 nm on both ends (FIG. 7 A, left). In the other class of beta 
mutants, the protruding 28 amino acid loop is removed and 
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placed the exposed cysteine on the apical domain itself. The 
mutant chaperonin assembled from this subunit has a ring of 
reactive thiols with a diameter of approximately 9 nm and an 
open pore into its central cavity (FIG. 7A, right). FIG. 7 A (top 
left) shows a model of a mutated HSP60 beta subunit indicat- 
ing apical loop cysteine placement by an arrow. The side view 
is consistent with both classes of chaperonin variants 
assembled from mutated beta subunits into two symmetri- 
cally stacked nine-fold rings (FIG. 7A, center), while FIG. 7A 
(bottom left) shows a top view of beta chaperonin variant 
revealing 3 nm pore ringed by nine cysteines. 

For the previously described mutants (PCT/US02/35889), 
the TEM image of a negatively stained 2D crystal of the beta 
chaperonin variant with cysteines substituted into the apical 
pores is shown in FIG. 7B. The two-sided plane group p312 
was assigned to the lattice through image analysis of micro- 
graphs of beta chaperonin 2D crystals from S. shibatae (Ko- 
eck et al., 1998 Biochim. Biophys. Acta 1429: 40-44). FIG. 
7A (top right). Result of genetic removal of the 28 residue 
apical loop of beta and substitution of cysteine at the site 
fusing the a-carbon backbone. Residue deletion choices were 
made based on the structural data from the model in FIG. 7A 
(left) as indicated by the arrows. FIG. 7B (bottom right) 
shows a top view of chaperonin variant with 9 nmpore ringed 
by cysteines. FIG. 7B shows the 2D crystal of 9-nm-pore 
variant detailing apparent increase in pore size by the change 
in electron density within the negatively stained rings. Both 
samples were imaged at the same condenser defocus setting. 
The ordering of the crystal is illustrated by the FFT of the 
image. FIG. 7C shows an illustration of the 3-nm-pore 2D 
crystal (p312) indicating how 5 nm gold binds within the 
engineered pores. FIG. 7E shows an illustration of the 9-nm- 
pore 2D crystal (p312) indicating how 10 nm gold binds 
within the engineered pores. 

As shown in a previous study (PCT/US0235889) the beta 
subunit S. shibatae proves to have sufficient structural plas- 
ticity in its apical domain to accommodate both the amino 
acid substitutions and deletions can be made without loss of 
its ability to form chaperonins and 2D crystals. Under reduc- 
ing conditions both classes of beta mutants formed chaper- 
onins that assembled into disk-shaped, hexagonally packed 
2D crystals up to 20 pm in diameter (PCT/US02/35889, and 
FIG. 7B, 7D). The order within the crystalline lattices is 
illustrated by fast Fourier transformation (FFT) of the TEM 
images (FIG. 7B, inset) which produced an optical diffracto- 
gram expressing the periodicity. 

In a previous study (PCT/US02/35889) to determine 
whether the thiol -containing 2D crystals of chaperonins acts 
as templates to bind and order nanoparticle QDs into arrays, 
commercially available gold nanoparticles (Ted Pella, Inc, 
Redding, Calif.) of different diameters were used (FIG. 8). 
FIG. 8 shows gold quantum dot binding to engineered chap- 
eronins and chaperonin templates. The uniform dispersion of 
these gold QDs in aqueous solution allows them to bind to 
hydrated chaperonin templates. To increase their likelihood 
of binding specifically to the reactive thiol of the cysteines, 
however, the nanoparticles can be passivated with the ligand 
bis(p-sulfonatophenyl)phenylphosphine (BSPP) (Loweth, C. 
J., Caldwell, W. B., Peng, X., Alivisatos, A. P. & Schultz, P. G. 
(1999) DNA-based assembly of gold nanocrystals. Angew. 
Chem. Int. Ed. 38: 1808-1812). BSPP displaces the citrate 
shell formed during synthesis of gold QDs (Novak, J. P., 
Nickerson, C., Franzen, S. & Feldheim, D. L. (2001) Purifi- 
cation of molecularly bridged metal nanoparticle arrays by 
centrifugation and size exclusion chromatography. Anal. 
Chem. 73:5758-5761) and thereby reduces nonspecific bind- 
ing of the QDs to the protein template. The passivated gold 
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QDs were reacted with the chaperonin templates attached to 
formvar-coated TEM grids (see Example 6.6) and imaged in 
TEM mode at 60 kV. At low magnifications the chaperonin 
2D crystals were visualized in the TEM using the electron 
5 density of the gold QDs themselves. FIG. 8A shows a low 
magnification TEM image of 10 pm diameter unstained 2D 
crystal of 9 nm chaperonin variant with 10 nm gold QDs 
bound. Contrast is from gold QDs bound to the crystalline 
lattice of the underlying protein template. Drying can cause 
to significant cracking and contributes to distortions and sepa- 
ration of regions of order within the array. FIG. 8B Higher- 
magnification stained TEM image of side views of 5 nm gold 
QDs tethered at the apical pores of the 3-nm-pore mutant 
chaperonins. At high magnification the chaperonin-gold 
1 5 interactions were vi sualized in the TEM by negative- staining 
samples with uranyl acetate. FIG. 8B (inset) shows a slab- 
view cutaway diagram of postulated orientation of 5 nm and 
10 nm gold QDs bound at the apical pores of the two chap- 
eronin variants. FIG. 8C shows a stained image of 5 nm gold 
20 QDs bound within the pores of the 3-nm-pore crystalline 
template. Occupied rings show the QDs (dark areas) sur- 
rounded and held in place by the outer protein density of the 
chaperonin pores. Empty rings have a brighter, less electron 
dense appearance. FIG. 8D shows ordered region of 10 nm 
25 gold bound to a 9-nm-pore template with similar area cover- 
age as in FIG. 8C. The protein holding the QDs in place is 
more difficult to see due to the larger size of the 10 nm QDs. 
Individual chaperonins in solution were observed to bind gold 
QDs on one or both ends. The QDs are presumably held in 
30 place by multiple dative bonds formed between the gold 
surface and the thiols within the pores (FIG. 8B). 

In a previous study (see PCT/US02/35889), in control 
experiments using chaperonin 2D crystals without exposed 
cysteines and with or without the amino acid loop deletions, 
35 the gold QDs appeared randomly distributed with no specific 
binding to the chaperonin crystals. On the surface of chap- 
eronin 2D crystals with cysteines, however, the gold QDs 
bound specifically onto the pores (FIG. 8C) forming regions 
of order on the protein (FIG. 8D) separated from one another 
40 by the cracked regions that resulted from drying, indicating 
that the engineered chaperonin crystals function as templates 
for gold QDs in solution. These chaperonin templates were 
size selective when attached to substrates and appeared to 
bind QDs only on the exposed side. Templates made from 
45 beta mutants with cysteines added to the apical loop that 
formed 3 nm rings of reactive thiols ordered 5 nm (±3 nm) 
gold QDs, but did not order 10 nm (±2 nm) or 1 5 nm (±1 nm) 
gold QDs, which bound randomly on the template surface. 
Variations in size distribution of gold QDs are a result of the 
50 manufacturer’s method of synthesis. The chaperonin tem- 
plates with the loop removed and cysteines on the apical 
domains that formed 9 nm rings of reactive thiols ordered 10 
nm (±2 nm) gold QDs, but 5 nm (±nm) and 15 nm (±1 nm) 
QDs bound randomly. This size selectivity is due to the acces- 
55 sibility and positioning of cysteine residues within the pores 
of the templates. 

As shown in a previous study (PCT/US02/35889), the pre- 
cision of the center-to -center spacing of gold QDs ordered by 
the chaperonin templates was 1 6 nm (±2 nm, n=200) for both 
60 5 and 10 nm gold QD arrays, as determined by TEM. This is 
consistent with the center-to-center spacing of the chaperonin 
pores in the underlying templates. The edge-to-edge spacing 
between QDs ranged from 6 to 10 nm for arrays made with 5 
nm (±3 nm) QDs bound to 3-nm-pore chaperonin templates 
65 and from 4 to 6 nm for arrays made with 1 0 nm (±2 nm) QDs 
bound to 9-nm-pore chaperonin templates. This variation in 
spacing can be attributed to both the variation in the size of the 
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gold QDs and to imperfections in the lattice of the chaperonin 
templates resulting from drying, cracking and dislocations 
within the arrays. The observed variation in QD spacing could 
be decreased with improved routes to QD synthesis having 
narrower size distributions. With more monodisperse QDs, 
the precision of center- to -center spacing in the gold nanoar- 
rays should make it possible to tune the physical properties of 
the arrays by controlling the interparticle coupling using dif- 
ferent sized QDs (Dujardin, et ah, 2002 Adv. Mater. 14: 
775-788). 

The chaperonin nanotemplate arrays can also bind and 
order semiconductor QDs to form nanoarrays. Quantum dots 
of size 4.5 mn luminescent core-shell (CdSe — ZnS QDs) 
were used (Dabbousi, B. O. et al. (1997) (CdSe)ZnS core- 
shell quantum dots: Synthesis and characterization of a size 
series of highly luminescent nanocrystallites. J. Phys. Chem. 
B 101 : 9463-9475). These QDs were reacted with 3-nm-pore 
chaperonin templates attached to glass or formvar substrates. 
Semiconductor QDs have low solubility in aqueous solutions. 
A QD suspension in trioctylphosphine/trioctylphosphine 
oxide (TOP/TOPO) diluted with butanol was reacted with 
dried chaperonin templates. Under these conditions the QDs 
bound to the cysteine-containing chaperonin templates (see 
PCT/US02/35889, and FIG. 9), but not appreciably to chap- 
eronin 2D crystals without exposed cyteines (FIGS. 12 and 
13). This is consistent with observations that Zn in the outer 
ZnS shell of CdSe — ZnS QDs binds solvent-exposed thiols 
(Chan, W. C. & Nie, S. (1998) Quantum dot bioconjugates for 
ultrasensitive nonisotopic detection. Science 281: 2016- 
2018). 

FIGS. 9A-D show the semiconductor QD nanoarray of a 
previously described mutant chaperonin polypeptide (PCT/ 
US02/35889). FIG. 9A shows differential interference con- 
trast (DIC) light micrograph of an 8 jam crystalline disc of 
3-nm-pore template with 4.5 mn luminescent CdSe — ZnS 
QDs bound. The differential interference contrast (DIC) 
image of the QD-bound template (FIG. 9A) and the corre- 
sponding fluorescent image reveal that QDs bound to cysteine 
thiol retain their luminescent properties (Bruchez, M., Jr., 
Moronne, M., Gin, P, Weiss, S. & Alivisatos, A. P. (1998) 
Semiconductor nanocrystals as fluorescent biological labels. 
Science 281 : 2013-2016). FIG. 9B shows both dry and rehy- 
drated discs fluoresced indicating the QDs bound to the sur- 
face of the template. Selectivity for cysteine is confirmed 
using 2D crystals of beta variant without added cysteines 
which showed minimal QD binding (supporting informa- 
tion), while FIG. 9C shows low magnification TEM of an 
unstained array of CdSe — ZnS QDs. Image contrast is due to 
the bound semiconductor QDs. The mottled appearance of 
both the QD luminescence and the electron density of low 
magnification TEM images indicate that the QDs are 
unevenly distributed on the chaperonin templates. FIG. 9D 
shows higher-magnification image of same crystal revealing 
an ordered region of QDs bound to the protein lattice. At 
higher magnification of unstained samples, regions of 
ordered QDs are visible. These regions are separated by unoc- 
cupied regions where QDs did not bind to the protein tem- 
plate. This difference could be due to drying or to solvent 
effects of the butanol, both of which may alter the structure of 
the chaperonin template and the accessibility of the thiols. 
Water-soluble (silica-capped) CdSe — ZnS (Gerion, D. et al., 
2001, “Synthesis and properties of biocompatible water- 
soluble silica-coated semiconductor nanocrystal s,” J. Phys. 
Chem. B 105: 8861-8871) QDs containing exposed thiol 
groups can bind more uniformly to hydrated chaperonin tem- 
plates. The thiols on these QDs, however, can cause them to 
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aggregate, which can result in the formation of defective 
arrays, in which case, it is preferable that the thiols be 
removed. 

Nanoscale materials can be maneuvered into nanoarrays 
5 and nanostructures by first tethering them to chaperonin sub- 
units and then ordered as the subunits assemble into chaper- 
onins and 2D crystals (nanoarrays) or other nanostructures. 
As an example, commercially available 1.4 mn gold QDs 
derivatized with surface-accessible, thiol-reactive maleimide 
10 groups can be used (monomaleimido Nanogold, Nanoprobes, 
Inc., Yaphank, N.Y.). FIGS. 10A-D show an embodiment of a 
nanogold nanoarray comprising a previously described 
mutant chaperonin polypeptide (PCT/US02/35889). FIG. 
10A shows a covalent attachment of 1 .4 mn monomaleimido 
15 Nanogold to subunits of loop -minus beta variant of the beta 
subunit of S. shibatae through Michael addition of cysteine 
thiol to QD surface maleimide groups. FIG. 10 A (right) 
shows possible arrangement of nine 1.4 nm covalently 
attached Nanogold QDs viewed at one end of a ring 
20 assembled from the derivatized subunit. FIG. 10B shows low 
magnification TEM image of a 2D crystalline array lightly 
stained with methylamine vanadate. The dark circular feature 
(arrow) demarks the analyzed area corresponding to the 
dashed-line spectrum in FIG. 10D and is the result of poly- 
25 merization of mobile hydrocarbon which is attracted to the 
beam periphery. FIG. 10C shows higher-magnification 
brightfield EF-TEM image of the array revealing the ordered 
pattern of electron density that extends across the crystalline 
template. FIG. 10D shows XEDS spectra of bare carbon film 
30 (solid line) and Nanogold array (dashed line) from the probe 
outlined in FIG. 10B. Characteristic X-ray peaks from gold 
(Au M a ~2 keV and Au L^-9.7 keV) confirm the presence of 
Nanogold within the array and the relative absence of Au on 
the support film. 

35 These Nanogold QDs were covalently bound to the mutant 
beta subunit with cysteine inserted in place of the 28 amino 
acid loop in the apical domain (see PCT/US02/35889, and 
FIGS. 10A-D). Subunits, with Nanogold attached, assembled 
into chaperonins in the presence of ATP/Mg 2 + (FIG. 10A); 
40 these chaperonins form 2D crystals (FIGS. 10B and 10C). 
The binding of the Nanogold QDs and localization within the 
pores of the chaperonin crystals was confirmed by analytical 
TEM (FIGS. 10 and 11A-11C). FIGS. 11A-11C show an 
HAADF STEM imaging of Nanogold array. FIGS. 11A-11C 
45 show the diameter of the features contributing to the array 
periodicity is consistent with multiple QDs localized within 
each ring. The diameter of electron density observed within 
the chaperonin rings forming the array (FIGS. 11A-11C) is 
approximately 8 to 12 times that observed for a single 1 .4 nm 
50 Nanogold QD (FIGS. 11A-11C). FIGS. 11A-11C show the 
periodicity from the Nanogold QDs localized within the rings 
extends across the entire crystal. 

In a previous study (PCT/US02/35889) ordered hexago- 
nally spaced inclusions within the crystalline template were 
55 observed and determined to contain gold by imaging methy- 
lamine vanadate stained Nanogold samples in brightfield 
Energy Filtering (EFTEM) mode and by using X-ray Energy 
Dispersive Spectroscopy (XEDS) (FIG. 10B-D). Oxygen 
plasma-treated carbon support films were used because they 
60 are more stable in an electron beam than formvar. Because the 
protein templates do not adhere to plasma-treated carbon as 
well as to formvar, samples were stained with methylamine 
vanadate to enable identification of their location on the sub- 
strate. The XEDS spectrum of the Nanogold array reveals 
65 distinct peaks due to gold that are well separated from vana- 
dium and copper peaks from the stain and carbon/copper 
support respectively (FIG. 10D). 
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High Angle Annular Dark Field (HAADF) Scanning/ 
Transmission Electron Microscopy (STEM) was also used in 
a previous study (PCT/US02/35889) to image the gold local- 
ized and ordered within the Nanogold arrays (FIGS. 11A-C). 
Comparisons of bare Nanogold to Nanogold ordered into an 
array revealed that multiple Nanogold QDs were localized 
within the pores of the crystallized chaperonins (FIGS. 11A 
and 11B). The HAADF image of the Nanogold crystal also 
confirms the presence of gold within the chaperonin pores 
because contrast in HAADF imaging mode is atomic number 
dependent, and nearly independent of focus or thickness. An 
HAADF comparison of the diameter of bare Nanogold par- 
ticles on carbon to the diameter of the gold nanoparticles 
contained within the central pores of the chaperonins that 
template the Nanogold into arrays reveals that the central 
diameters are approximately eight to twelve times that of the 
diameter of a single Nanogold QD. This observation is con- 
sistent with a model which suggests that each ring can contain 
up to nine Nanogold QDs (one per subunit). A lower magni- 
fication HAADF image of a similar area of an array reveals 
the ordering of the gold extends throughout the template 
(FIG. 11C). High resolution XEDS mapping attempts of the 
gold within the array were unsuccessful as the crystals were 
destroyed with the electron dose needed for such measure- 
ments. EELS (Electron Energy Loss Spectroscopy) mapping 
using the Au O shell was correspondingly unsuccessful 
because the V M shell edge lies in close proximity to the Au 
O shell and thus masks the gold signal. FIG. 12 shows a 
control experiment showing DIC (left) and fluorescent (right) 
images of non-cys-mutated chaperonin crystals after incuba- 
tion with CdSe — ZnS QDs. 

The luminescence intensity of the fluorescent image is 
barely visible indicating minimal QD binding. FIG. 13 shows 
an Energy Filtered TEM thickness map of a typical 2D pro- 
tein crystal. The intensity in this image is the ratio of the 
inelastic signal to the elastic signal and is proportional to the 
ratio of t fk where lambda is the mean free path for inelastic 
scattering and t is the local mass thickness. Regions of nomi- 
nally uniform intensity indicate regions of nominally con- 
stant mass thickness. Increasing intensity indicates increased 
thickness. At the various regions and at the edges of the 
crystal one can observe clear transitions indicating that the 
crystal is composed of several layers. 

In a previous study (PCT/US02/35889) crystal thickness 
measurements (AFM and TEM) suggest that these crystals 
can be multilayered (supporting information), and are 
observed as crystals ranging from 1 to 10 layers (approxi- 
mately 20 to 200 nm). The assembly of QDs into arrays by 
first covalently attaching them to subunits may create more 
defect-tolerant arrays because each chaperonin is composed 
of 1 8 subunits and therefore there are 1 8 chances for each site 
in the array to contain at least one QD. Likewise, the regions 
of QD ordering within arrays assembled this way appear to 
span the dimensions of the crystalline template and with 
fewer defects than previously observed. These types of arrays 
may find use in applications that demand longer range order- 
ing than the 5 and 1 0 nm gold and semiconductor nanoparticle 
binding protocols allow. 

The present invention provides a hybrid bio/inorganic 
approach to nanophase materials organization where the 
functionality of proteins can be rationally engineered. Using 
structural information and recombinant biotechnology tech- 
niques, genetically engineered chaperonins can be made to 
function both as nanotemplates and as vehicles for controlled 
nano scale organization of preformed QDs into ordered 
nanoarrays, e.g., arrays of nanomagnets. These nanotem- 
plates, nanostructures, and nanoarrays can be “wired” 
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together into functional nanodevices, for example by using 
genetics, as alternate binding sites may be engineered at 
different locations on the chaperonin. 

The possibility to induce asymmetry within the arrays by 
5 engineering alternate facets of the protein crystal is exploited 
in forming the nanodevices of the present invention. A nan- 
odevice comprises at least one nanotemplate, at least one 
nanostructure, at least one nanoarray or some combination 
thereof. A nanodevice can, for example, be an electronic, 
to semiconductor, mechanical, nanoelectromechanical, mag- 
netic, photonic, optical, optoelectronic or biomedical device 
formed from at least one nanostructure, at least one nanoar- 
ray, and/or at least one nanotemplate. 

In a specific embodiment, the nanostructures are organized 
15 into a nanodevice that functions with the chaperonins still 
present. In an alternate embodiment, the chaperonins are 
removed before the functioning of the nanodevice. The nan- 
otemplate and nanostructure provide an organizational basis 
for attached molecules, nanoparticles and quantum dots. The 
20 attached nano scale materials can be equally spacedat, e.g., 15 
nm intervals, or selectively place at pre-determined sites. 
Taking advantage of the fact that enzymes (such as proteases) 
can be used to specifically remove the chaperonin, the nan- 
otemplates can serve to leave behind pure material accurately 
25 placed on a surface at nano-scale resolution. 

The steps in the formation of a nanodevice are similar to 
those for forming a nanostructure, except that the building 
blocks are nanotemplates, nanostructures, and/or nanoarrays. 
The steps can include adding one or more nanotemplates, 
30 nanostructures, nanoarrays, or some combination thereof to a 
surface, and then removing any unbound nanotemplates, 
nanostructures, or nanoarrays. The steps are repeated any 
desired number of times, with the choice of material intro- 
duction being changed at each step to build the desired nan- 
35 odevice. Other masking techniques, e.g., semiconductor fab- 
rication can also be combined with the present invention in 
the construction of the nanodevice. 

There is no direct parallel of the present invention in the 
semiconductor manufacturing industry. The use of protein- 
40 based templates that self assemble into highly ordered struc- 
tures allow of the engineering of semiconductor materials on 
a size regime much smaller than that currently attainable. 
Further, given the diversity of the chaperonin system (e.g, its 
ability to bind other biomolecules such as lipid and DNA/ 
45 RNA) the compositions and devices of the invention can also 
be utilized in a biomedical, e.g., biomedical device, context. 

The invention further provides methods to selectively 
deposit nanoparticles or quantum dots in an ordered array 
onto inorganic substrates. DNA manipulation and genetic 
50 engineering of the genes that code for chaperonins can be 
used to generate specificity in molecular recognition at 
defined sites within the protein. For example, by introducing 
cysteine residues into the protein, it can specifically bind 
colloidal gold molecules through dative bonding between the 
55 sulfhydryl (SH) moiety of Cys and AuO. This allows for the 
organization of gold nanoparticles into ordered arrays onto 
substrates. After organizing the gold onto the surface, the 
protein can be removed using a reactive ion cold plasma, 
leaving the patterned gold in place on a clean surface (FIG. 
60 14), thereby producing a nanodevice of the invention. The 
HSP60s bound with proteins or peptides are capable of 
releasing the bound proteins or peptides in the presence of 
adenosine triphosphate (ATP) or acidic conditions (Udono 
and Srivastava, 1993 J. Exp. Med. 178:1391-1396). 

65 With advances in microbial genetics, for example using 
phage and cell surface display to identify inorganic binding 
peptide sequences (Whaley et al., Nature 405: 665-668), the 
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usefulness of this system extends beyond soft metals to other 
materials by, for example, the addition of sequences back into 
the loop region that was removed. 

Examples of additional, non-limiting applications of the 
nanodevices include field emitters, sensors, optoelectronic 
and all-optical switches, lenses, probes, lasers, nanoelectro- 
mechanical systems (NEMS), circuitry and nanoelectronics, 
nanomachines (e.g., by attaching nanomotors), neural net- 
works (nanoelectrodes for connections), nanocomputers, 
quantum computers, high-density magnetic memory or stor- 
age media, photonic crystals, nanocrystal antennas, multi- 
nanowell assay plates, nanocatalysts (e.g., palladium), nan- 
opores for single-molecule DNA sequencing, amplifiers for 
telecommunications (approximately 7 nm PbSe and PbS 
quantum dots have a tunable gap near 1 500 nm). Applications 
include, for example, memory or storage devices (e.g., hard- 
disk drive read heads, magnetic RAM), magnetic field sen- 
sors, magnetic logic devices, logic gates, and switches. 

Further applications can also include, for example, biochip 
applications. Quantum dots in a biochip, for example, can 
each account for at least one or several data bits. The position 
of a single electron in a quantum dot can attain several states, 
so that a quantum dot can represent a byte of data. In an 
alternate embodiment, a quantum dot can be used in more 
than one computational instruction at a time. 

Other applications of quantum dots include nanomachines, 
neural networks, and high-density memory or storage media. 

In an alternate embodiment, the nanodevice, nanotemplate 
or nanoarray functions as a single-molecule probe for DNA 
detection, hybridization, and sequencing. 

Polymer micro spheres with uniformly embedded poly- 
mers have applications as, for example, active fluorescent 
building blocks in flat panel displays and luminescent labels 
in biological detection. This application is achieved by form- 
ing a nanodevice comprising a nanoarray of embedded poly- 
mer nanoparticles 

Still further applications relate to molecular motors, e.g., 
molecular motors in a biomedical context. 

The following examples are presented to illustrate the 
present invention and to assist one of ordinary skill in making 
and using the same. The methodology and results may vary 
depending on the intended goal of treatment and the proce- 
dures employed. The examples are not intended in any way to 
otherwise limit the scope of the invention. 

Example 1 

The following provides a description of methods used to 
produce DNA sequences encoding the modified chaperonin 
polypeptides comprising relocated N-terminal and C-termi- 
nal ends, and encoding fusion proteins which are modified 
chaperonin polypeptide comprising relocate N- and C-teimi- 
nal ends where the carboxy-terminal end is joined with a 
sequence encoding yellow fluorescent protein. 

Materials and Methods 

DNA Constructs 

Genes encoding the permuted chaperonin proteins were 
created using a two-step polymerase chain reaction (PCR) 
method in which fragments of DNA before and after the 
permutation site are amplified with the flexible linked added 
to the beginning and end of the gene. The starting template 
DNA sequence encoding a TF55 beta subunit from Sulfolobus 
shibatae is shown in FIG. 28A. In the second step, the two 
fragments are annealed together at the flexible linker and the 
resulting template DNA was amplified by PCR. The DNA 
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encoding the permuted protein was then cloned into an E. coli 
expression vector (pETl 9b, Novagen) at the Nco I and Xho I 
restrictions sites. 

The resulting DNA sequences encoding the five different 
5 circular permutated TF55 polypeptides at positions 1 53, 267, 
316, 480, or 499 are shown in FIGS. 22-26, respectively. 
Models of the five different circular permutated TF55 
polypeptides in folded and assembled conformations are 
shown in FIGS. 17A-E. 

Protein Expression 

The expression vectors described above were transformed 
into E. coli BL21 DE3 cells containing a second vector that 
supplements rare tRNAs (BL21 DE3 CodonPlus RIL, Strat- 
15 agene). This second vector was used because E. coli codon 
usage differs from Sulfolobus. Some tRNAs that are common 
in Sulfolobus (and therefore in Sulfolobus genes) are rare in E. 
coli. These cells were then culture in Luria Broth with 100 
micrograms/liter ampicillin and 34 micrograms/liter 
20 chloramphenicol. IPTG was added to a final concentration of 
1 mM when the culture reached an optical density of 0.4 -0.8 
at 600 nm. After four hours the cells were harvested by 
centrifugation. The resulting pellet was weighed, resus- 
pended in 25 ml HEPES, 1 mM EDTA, pH 7.5 and stored at 
25 minus 80 degrees C. 

Protein Purification 

Cells were removed from storage at minus 80 degrees C. 
and thawed. Protease inhibitor cocktail (Sigma- Aldrich) was 
added during thawing at 0.25 ml per gram of wet pellet 
30 weight. When thawed, the cell suspension was placed on ice 
and sonicated 3 times for 2 minutes each with a one minute 
pause between sonications (Branson Sonifier 450, 60% duty 
cycle, output 6). The cell lysate was placed in a water bath at 
70 degrees C. water bath for 30 minutes to precipitate heat- 
35 labile proteins . After heating, the lysate was cooled on ice for 
10 minutes and then centrifuged at 17,000 g, at 4 degrees C., 
for 30 minutes. The supernatant was decanted and 2.5% (v/v) 
of a 5% (w/v) stock solution of polyethyleneimine (Sigma- 
Aldrich) at pH 7.8 was added to precipitate nucleic acids. 
40 After 10 minutes nice, the solution was again centrifuged at 
17,000 g, at 4 degrees C., for 30 minutes. The supernatant 
from this step was filtered through a 0.45 micrometer pore 
membrane and diluted to 50 ml with 25 mM HEPES, 1 mM 
EDTA, pH 7.5. This protein solution was loaded onto a 
45 MonoQ 10/10 anion exchange column (Amersham Bio- 
sciences) and eluted by a linear gradient from 1 to 400 mM 
NaCl in 25 mM HEPES, 1 mM EDTA, pH 7.5. Peak fractions 
were confirmed by denaturing polyacrylamide gel electro- 
phoresis, pooled and concentrated using Centriprep YM30 
50 centrifugal concentrators (Millipore). Buffer was exchanged 
in the centrifugal concentrator with at least 10 volumes of 25 
mM HEPES, 1 mM EDTA, pH 7.5. Protein concentration was 
determined by measuring absorbance at 280 nm using an 

estimated extinction coefficient of 56,820 M -1 cm -1 . 

55 

Inclusion Body Preparation. 

Cells were removed from storage at -80 degrees C. 
Reagents were added to a concentration of 25 mM HEPES, 
100 mM NaCl, 0.5% TritonX-100 and 0.25 ml Protease 
60 Inhibitor Cocktail (Sigma-Aldrich) per gram wet pellet 
weight before thawing. Cells were lysed by sonication, 3 
times for 2 minutes each with a 3 minute pause (Branson 
Sonifier 450, 40% duty cycle, output 5). Reagents were added 
to the cell lysate to a concentration of 10 mM MgS04, 1 
65 mg/ml Lysozyme (Sigma-Aldrich) and 1 microliter/10 ml 
Benzonase (Sigma-Aldrich) and incubated at room tempera- 
ture for 20 minutes before centrifugation at 6000xG and 4 
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degrees C. for 15 min. The pellet was resuspended in 30 ml 
Buffer R (50 mM Tris-HCl pH 8.0, 100 mM NaCl, 5 mM 
EDTA, 0.5% Triton X-100), then sonicated for 1 minute at 
previous settings. Reagents for 10 mM MgS04, 1 mg/ml 
Lysozyme and 1 microliterl/ 1 0 ml Benzonase were added, the 5 
lysate was incubated at room temperature for 20 minutes, 
then centrifuged at 6000xG, 4 degrees C., 10 min. The pellet 
was resuspended in 30 ml Buffer R, sonicated for 1 min and 
centrifuged 10 min. This last step was repeated (resuspended, 
sonicated, centrifuged). Next, the pellet was resuspended in 
30 ml 25 mM HEPES pH 8.0, 100 mM NaCl, 5 mM EDTA, 

0.1 NaN3, and centrifuged for 1 min at 6000xG, 4 degrees C. 
(This pellet was stored at -80° C. overnight). 

The pellet was solubilized in 500 microliter 100 mM Tris- 
HCl pH 7.5, 50 mM Glycine, then denatured in a solution of 
10 M Urea pH 7.5, 100 mM Tris, 50 mM Glycine to a final 
concentration of 8.5 M Urea. The protein was refolded by 
adding dropwise into 100 ml 50 mM Tris-HCl, 35 mM KC1, 

2 mM MgC12, 1 mM Beta mercaptoethanol pH 7.5. 

This protein solution was purified by ion exchange chro- 
matography as in the circular pennutants. 

Polyacrylamide Gel Electrophoresis 

The purity of the proteins was evaluated using denaturing 
polyacrylamide gel electrophoresis (GeneMate 8-16% 
Express Gels, ISC BioExpress). The ability of each of the 
circular permutations to form double rings was assessed by 
non-denaturing 4-40% gradient polyacrylamide gel (FIG. 18) 
as previously described (J K Trent, et al., 1991 Nature 354: 
490-493). 

Formation of Crystals and Filaments 

Crystal or filaments were formed from the circular permu- 
tant proteins by placing them in solution at concentrations of 
1 to 5 mg/ml in 25 mM HEPES at pH7.5 with 1 mM ATP and 
25 mM MgCl 2 . The samples were stored at room temperature 
for 1 to 3 days before examination for higher order structure 
by light and electron microscopy. 

Results 

Transmission Electron Microscopy 

Samples were prepared for the electron microscope by 
drop casting on carbon-coated formvar substrates (Ladd) and 
stained with 1% uranyl acetate. Samples were visualized 
using a Leo 912AB electron microscope in bright field TEM 
mode at 65 kV with a tungsten filament electron source. 

Samples of the five different circular pennutated chapero- 
nin polypeptides (without the joined EYFP sequence) were 
analyzed at 1 hour or 24 hours after addition of 1 mM ATP and 
25 mM Mg 2+ (FIG. 19A-J). All five circular permuted chap- 
eronins analyzed formed double rings having the same struc- 
ture as wild-type chaperonin double rings, with two rings of 
nine subunits each (FIG. 19A-E). The circular permutated 
chaperonin polypeptides 153, 267 and 316 formed predomi- 
nantly crystalline structures. The circular permutated chap- 
eronin polypeptides 480 and 499 formed predominantly fila- 
mentous structures. For permutants at positions 480 and 499, 
heating the sample to 70 degrees C. for 30 minutes immedi- 
ately after addition of ATP and Mg 2+ produced longer fila- 
ments. This same heating regime did not improve crystalli- 
zation of permutants 153, 267 or 316. 

The circular permutant EYFP fusion proteins -267 and 
-480 were analyzed via TEM. The 480 fusion protein 
assemble into chaperonin double rings having the same or 
similar form as wild-type chaperonins (FIG. 20B), and the 
267 fusion protein organizes into a crystalline structure (FIG. 
20D). 


Fluorescence Analyses 

The circular permutant fusion proteins (267 and 480) were 
analyzed using Leica filter cube 13 (excitation 470 nm, emis- 
sion 515 nm). The chaperonin-EYFP fusion proteins (circular 
permutants 267 and 480) exhibit fluorescence similar excita- 
tion and emission spectra of EYFP (FIG. 20A). Additionally, 
fluorescence microscopy shows the 267 circular permutant 
fusion protein organizes into a crystalline structure (FIG. 
20C). 

Discussion 

We present a general method for modifying protein tem- 
plates using circular permutation so that additional peptide 
sequences can be added in a wide variety of specific locations. 
Circular permutation is a reordering of the polypeptide chain 
such that the original N- and C-terminal ends are joined and 
new end termini are created elsewhere in the protein. New 
protein sequences can be joined to the relocated termini with- 
out perturbing the protein structure and with minimal limita- 
tion on the size and conformation of the added sequence. We 
have used circular permutation to modify a chaperonin 
HSP60 protein template (TF55 beta subunit), by relocating 
the terminal ends at five different locations distributed across 
the surface of the protein complex. 

Chaperonins are composed of 1 4, 1 6 or 1 8 subunits known 
as heat shock protein 60 (HSP60) (FU Hard and M Hayer- 
Hartl 2002 Science 295:1852-1858), where 60 refers to their 
molecular mass of approximately 60 kilodaltons. In the pres- 
ence of ATP and Mg , these subunits assemble into a two- 
stackedring structure. The modified chaperonin polypeptides 
of the present invention were based on an HSP60 from the 
hyperthermophilic archeon Sulfolobus shibatae , which lives 
in geothermal hot springs and thrives at pH 2 and tempera- 
tures up to 85 degrees C. Sulfolobus shibatae produces three 
related HSP60 proteins, designated alpha, beta and gamma. 
These three HSP60 proteins assemble in various stoichiom- 
etries, into octadecameric complexes with nine subunits per 
ring. These double ring structures are symmetric around a 
nine-fold axis and across a perpendicular two-fold axis. The 
surface adjacent to the interface between the rings is known as 
the equatorial surface and the surface of the ring most distant 
from the equator is the apical surface. Overall, the double 
rings are approximately 17 nm in diameter and 18 nm in 
length, and have a central pore. The Sulfolobus shibatae chap- 
eronins are known to form higher order structures such as 
filaments (J D Trent, et al., 1997 Proc Natl Acad Sci USA 
94:5383-5388) and two-dimensional crystals (J P Koeck, et 
al., 1998 Biochim Biophys Acta 1429:40-44; M J Ellis, et al., 
1998 J Struct Biol 123:30-36). The filaments are based on 
interactions between the apical surfaces on one double ring 
with the apical surfaces of two adjacent double rings. The two 
dimensional crystals are based on interactions between the 
equatorial surface of one double ring with equatorial surfaces 
of six adjacent double rings. Bundled filaments have also 
been observed and are believed to contain both types of inter- 
actions. 

We selected the HSP60 beta subunit (TF55, beta subunit) 
for our research because it forms homo-oligomeric rings that 
also assemble into filaments and two-dimensional crystals. 
We previously reported attaching nanoparticles to the chap- 
eronin double rings using thiol chemistry provided by a single 
cysteine residue substituted into the HSP60 protein sequence 
(PCT/US02/35889). We report here creating a chaperonin 
circular permutant by joining a second peptide sequences 
having arbitrary length and topology (yellow fluorescent pro- 
tein) onto the chaperonin protein sequence. The second pep- 
tide sequence is fused to the C-terminal end which is relo- 
cated from the central pore to a position on the exterior of the 
chaperonin polypeptide. 

Circular permutant proteins have been reported using other 
proteins. Published studies of circular permutations in other 
proteins generally conclude that, for proteins where the native 
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amino and carboxyl termini are near in space, many new 
locations for the termini are viable (P T Beemink, et ah, 2001 
Protein Sci 10:528-537; U Heinemann and M Hahn 1995 
Prog Biophys Mol Biol 64:121-143; M Iwakura, et ah, 2000 
Nat Struct Biol 7:580-585). In one study, dihydrofolate 5 
reductase was permuted at each of the 158 possible positions, 
resulting in 85 permutants that are competent to fold (M 
Iwakura, et ah, 2000 Nat Struct Biol 7:580-585). 

We selected five sites for circular permutation that are 
distributed across the exterior surface of the double ring, to 
These sites met the following criteria: (a) they are not buried; 

(b) they are not near the subunit interfaces; and (c) they are not 
part of regular secondary structure (alpha helices and beta 
sheets). In five different circular permutants, the new end 
termini are relocated after residues 153, 267, 316, 480 and 15 
499 in the native TF5 5 beta subunit from Sulfolobus shibatae. 
The original end termini are linked by a flexible peptide 
sequence (GGSGGT). Models of the permuted proteins are 
shown in FIG. 17. 

The circular permuted proteins were expressed at high 20 
levels in E. coli and were readily purified to better than 95% 
purity as determined by SDS-PAGE. The circular permuted 
proteins assemble into double rings with efficiency similar to 
wild-type protein, as demonstrated by non-denaturing gel 
electrophoresis (FIG. 1 8) . Examination by transmission elec- 25 
tron microscopy (TEM) indicates that the circular permuted 
chaperonin double rings have the same form as wild-type 
chaperonin double rings, with two rings of nine subunits each 
(FIG. 19A-E). 

We expected circular permutation to affect the assembly of 30 
the resulting chaperonins into higher order structures. Spe- 
cifically, perturbation of the peptide chain and addition of the 
new amino and carboxyl termini at different positions on the 
surface of the chaperonin double rings was expected to affect 
their propensity to form filamentous structures versus two- 35 
dimensional crystals. Changes to the apical surface of the 
chaperonin should destabilize the interactions necessary to 
form filaments, resulting in samples containing two-dimen- 
sional crystals with little or no filamentous material. On the 
other hand, changes on the equatorial surface of the chapero- 40 
nin should destabilize interactions that are necessary to form 
two-dimensional crystals resulting in predominantly filamen- 
tous structures. 

Examination of the assembled circular permutants using 
transmission electron microscopy confirms that permutations 45 
at positions 153, 267 or 316 results in predominantly crystal- 
line samples. Conversely, permutations at positions 480 or 
499 produce predominantly filamentous structures (FIGS. 
19F-J). For permutants at positions 480 and 499, heating the 
sample to 70 degrees C. for 30 minutes immediately after 50 
addition of ATP and Mg 2+ produced longer filaments. This 
same heating regime did not improve crystallization of per- 
mutants 153, 267 or 316. 
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The fusion proteins were created by joining the circular 
permutant 267 and 480 with a sequence encoding the yellow 
fluorescent protein. Yellow fluorescent protein (EYFP) is a 28 
kDa protein with a fluorescent excitation maximum at 5 1 2 nm 
and emission maximum at 529 nm. The fluorescent chro- 
mophore produced by EYFP is a result of cyclization and 
oxidation reactions, which require proper folding of the 
EYFP. The chaperonin-EYFP fusion proteins (circular per- 
mutants 267 and 480) exhibit fluorescence excitation and 
emission spectra typical of EYFP (FIG. 20A). This result 
indicates the EYFP assumed its correct structure to facilitate 
the reactions that produce the chromophore. Furthermore, the 
480 fusion protein assemble into chaperonin double rings 
having the same or similar form as wild-type chaperonins 
(FIG. 2 0B), and the 267 fusion protein organizes into a crys- 
talline structure (FIGS. 20C and D). These results indicate 
that fused EYFP is free in both fusion proteins to assume its 
most favorable conformation, and does not interfere with 
formation of double rings. Rings composed of the 267 fusion 
permutants readily form two-dimensional crystals of the 
same form as the 267 permutants lacking the fused EYFP. By 
contrast, the 480 fusion permutants do not form filaments 
under the same conditions as the 480 permutants lacking the 
fused EYFP. For example, filament formation by the 480 and 
499 permutants (lacking EYFP) is improved upon heating to 
70 degrees C. However, heating does not improve filament 
formation of the 480 fusion permutant because the heat 
unfolds the EYFP. 

The chaperonin circular permutants described above are 
competent to form the double-ring structures typical of wild- 
type chaperonin proteins. We also fused a fluorescent protein 
(EYFP) to two representative permutants (267 and 480), and 
demonstrate that the fluorescent protein folds into its active 
structure and does not interfere with assembly of chaperonin 
double-rings. 

The yellow fluorescent protein is a convenient reporter 
molecule. Other peptide sequences can also be used to pro- 
vide desirable properties to the chaperonin circular permu- 
tants. For example, peptide sequences having catalytic, 
chemical reactivity, or binding specificity for inorganic mate- 
rials are known. The peptide sequences can be a linear, con- 
strained loop, or a hilly structured protein. Some of these 
sequences have been discovered through studies of natural 
biomineralization (D E Morse, et al., 1 993 Mat Res Soc Symp 
Proc 292:59-67; M L Paine and M L Snead 1997 J Bone 
Miner Res 12:221-227; N Kroger, et ah, 1999 Science 286: 
1129-1132; J M Slocik, et ah, 2002 Nanoletters 2:169-173), 
while others have been isolated from large combinatorial 
libraries using phage display or bacterial cell-surface display 
(M Sarikaya, et ah, 2003 Nat Mater 2:577-585). These 
sequences serve as an interface between the highly versatile 
structural properties of proteins, and the optical an electronic 
properties of inorganic nanoparticles. 


SEQUENCE LISTING 


<16 0> NUMBER OF SEQ ID NOS: 91 

<210> SEQ ID NO 1 
<211> LENGTH: 552 
<212> TYPE: PRT 

<213> ORGANISM : Sulfolobus shibatae 
<22 0> FEATURE: 

<22 1> NAME/ KEY: MI SC_FEATURE 

<22 3 > OTHER INFORMATION: TF55 beta subunit 
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-continued 


<400 > SEQUENCE: 1 


Met 

Ala 

Thr 

Ala 

Thr 

Val 

Ala 

Thr 

Thr 

Pro 

Glu 

Gly 

lie 

Pro 

Val 

lie 

1 




5 





10 





15 


He 

Leu 

Lys 

Glu 

Gly 

Ser 

Ser 

Arg 

Thr 

Tyr 

Gly 

Lys 

Glu 

Ala 

Leu 

Arg 




20 





25 





30 



Ala 

Asn 

lie 

Ala 

Ala 

Val 

Lys 

Ala 

lie 

Glu 

Glu 

Ala 

Leu 

Lys 

Ser 

Thr 



35 





40 





45 




Tyr 

Gly 

Pro 

Arg 

Gly 

Met 

Asp 

Lys 

Met 

Phe 

Val 

Asp 

Ser 

Leu 

Gly 

Asp 


50 





55 





60 





lie 

Thr 

lie 

Thr 

Asn 

Asp 

Gly 

Ala 

Thr 

lie 

Leu 

Asp 

Lys 

Met 

Asp 

Leu 

65 





70 





75 





80 

Gin 

His 

Pro 

Thr 

Gly 

Lys 

Leu 

Leu 

Val 

Gin 

lie 

Ala 

Lys 

Gly 

Gin 

Asp 





85 





90 





95 


Glu 

Glu 

Thr 

Ala 

Asp 

Gly 

Thr 

Lys 

Thr 

Ala 

Val 

lie 

Leu 

Ala 

Gly 

Glu 




100 





105 





110 



Leu 

Ala 

Lys 

Lys 

Ala 

Glu 

Asp 

Leu 

Leu 

Tyr 

Lys 

Glu 

lie 

His 

Pro 

Thr 



115 





120 





125 




lie 

lie 

Val 

Ser 

Gly 

Tyr 

Lys 

Lys 

Ala 

Glu 

Glu 

lie 

Ala 

Leu 

Lys 

Thr 


130 





135 





140 





lie 

Gin 

Asp 

lie 

Ala 

Gin 

Pro 

Val 

Ser 

lie 

Asn 

Asp 

Thr 

Asp 

Val 

Leu 

145 





150 





155 





160 

Arg 

Lys 

Val 

Ala 

Leu 

Thr 

Ser 

Leu 

Gly 

Ser 

Lys 

Ala 

Val 

Ala 

Gly 

Ala 





165 





170 





175 


Arg 

Glu 

Tyr 

Leu 

Ala 

Asp 

Leu 

Val 

Val 

Lys 

Ala 

Val 

Ala 

Gin 

Val 

Ala 




180 





185 





190 



Glu 

Leu 

Arg 

Gly 

Asp 

Lys 

Trp 

Tyr 

Val 

Asp 

Leu 

Asp 

Asn 

Val 

Gin 

lie 



195 





200 





205 




Val 

Lys 

Lys 

His 

Gly 

Gly 

Ser 

lie 

Asn 

Asp 

Thr 

Gin 

Leu 

Val 

Tyr 

Gly 


210 





215 





220 





lie 

Val 

Val 

Asp 

Lys 

Glu 

Val 

Val 

His 

Pro 

Gly 

Met 

Pro 

Lys 

Arg 

lie 

225 





230 





235 





240 

Glu 

Asn 

Ala 

Lys 

lie 

Ala 

Leu 

Leu 

Asp 

Ala 

Ser 

Leu 

Glu 

Val 

Glu 

Lys 





245 





250 





255 


Pro 

Glu 

Leu 

Asp 

Ala 

Glu 

lie 

Arg 

lie 

Asn 

Asp 

Pro 

Thr 

Gin 

Met 

His 




260 





265 





270 



Lys 

Phe 

Leu 

Glu 

Glu 

Glu 

Glu 

Asn 

lie 

Leu 

Lys 

Glu 

Lys 

Val 

Asp 

Lys 



275 





280 





285 




lie 

Ala 

Ala 

Thr 

Gly 

Ala 

Asn 

Val 

Val 

lie 

Cys 

Gin 

Lys 

Gly 

lie 

Asp 


290 





295 





300 





Glu 

Val 

Ala 

Gin 

His 

Tyr 

Leu 

Ala 

Lys 

Lys 

Gly 

lie 

Leu 

Ala 

Val 

Arg 

305 





310 





315 





320 

Arg 

Ala 

Lys 

Lys 

Ser 

Asp 

Leu 

Glu 

Lys 

Leu 

Ala 

Arg 

Ala 

Thr 

Gly 

Gly 





325 





330 





335 


Arg 

Val 

lie 

Ser 

Asn 

lie 

Asp 

Glu 

Leu 

Thr 

Ser 

Gin 

Asp 

Leu 

Gly 

Tyr 




340 





345 





350 



Ala 

Ala 

Leu 

Val 

Glu 

Glu 

Arg 

Lys 

Val 

Gly 

Glu 

Asp 

Lys 

Met 

Val 

Phe 



355 





360 





365 




Val 

Glu 

Gly 

Ala 

Lys 

Asn 

Pro 

Lys 

Ser 

Val 

Ser 

lie 

Leu 

lie 

Arg 

Gly 


370 





375 





380 





Gly 

Leu 

Glu 

Arg 

Val 

Val 

Asp 

Glu 

Thr 

Glu 

Arg 

Ala 

Leu 

Arg 

Asp 

Ala 

385 





390 





395 





400 

Leu Gly 

Thr 

Val 

Ala 

Asp 

Val 

lie 

Arg 

Asp 

Gly 

Arg 

Ala 

Val 

Ala 

Gly 
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405 410 415 


Gly Gly 

Ala 

Val 

Glu 

lie 

Glu 

lie 

Ala 

Lys 

Arg 

Leu 

Arg 

Lys 

Tyr 

Ala 




420 





425 





430 



Pro 

Gin 

Val 

Gly 

Gly 

Lys 

Glu 

Gin 

Leu 

Ala 

lie 

Glu 

Ala 

Tyr 

Ala 

Asn 



435 





440 





445 




Ala 

He 

Glu 

Gly 

Leu 

lie 

Met 

lie 

Leu 

Ala 

Glu 

Asn 

Ala 

Gly 

Leu 

Asp 


450 





455 





460 





Pro 

lie 

Asp 

Lys 

Leu 

Met 

Gin 

Leu 

Arg 

Ser 

Leu 

His 

Glu 

Asn 

Glu 

Thr 

465 





470 





475 





480 

Asn 

Lys 

Trp 

Tyr 

Gly 

Leu 

Asn 

Leu 

Phe 

Thr 

Gly 

Asn 

Pro 

Glu 

Asp 

Met 





485 





490 





495 


Trp 

Lys 

Leu 

Gly 

Val 

lie 

Glu 

Pro 

Ala 

Leu 

Val 

Lys 

Met 

Asn 

Ala 

lie 




500 





505 





510 



Lys 

Ala 

Ala 

Thr 

Glu 

Ala 

Val 

Thr 

Leu 

Val 

Leu 

Arg 

lie 

Asp 

Asp 

lie 



515 





520 





525 




Val 

Ala 

Ala 

Gly 

Lys 

Lys 

Gly Gly 

Ser 

Glu 

Pro 

Gly 

Gly 

Lys 

Lys 

Glu 


530 





535 





540 





Lys 

Glu 

Glu 

Lys 

Ser 

Ser 

Glu 

Asp 









545 





550 












< 2 10 > SEQ ID NO 2 
< 2 11 > LENGTH: 548 
<212 > TYPE: PRT 

<213 > ORGANISM: Escherichia coli 
<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FE ATURE 
<223 > OTHER INFORMATION: GroEL 


< 4 00 > SEQUENCE: 2 


Met 

Ala 

Ala 

Lys 

Asp 

Val 

Lys 

Phe 

Gly 

Asn 

Asp 

Ala 

Arg 

Val 

Lys 

Met 

1 




5 





10 





15 


Leu 

Arg 

Gly 

Val 

Asn 

Val 

Leu 

Ala 

Asp 

Ala 

Val 

Lys 

Val 

Thr 

Leu 

Gly 




20 





25 





30 



Pro 

Lys 

Gly 

Arg 

Asn 

Val 

Val 

Leu 

Asp 

Lys 

Ser 

Phe 

Gly 

Ala 

Pro 

Thr 



35 





40 





45 




lie 

Thr 

Lys 

Asp 

Gly 

Val 

Ser 

Val 

Ala 

Arg 

Glu 

lie 

Glu 

Leu 

Glu 

Asp 


50 





55 





60 





Lys 

Phe 

Glu 

Asn 

Met 

Gly 

Ala 

Gin 

Met 

Val 

Lys 

Glu 

Val 

Ala 

Ser 

Lys 

65 





70 





75 





80 

Ala 

Asn 

Asp 

Ala 

Ala 

Gly 

Asp 

Gly 

Thr 

Thr 

Thr 

Ala 

Thr 

Val 

Leu 

Ala 





85 





90 





95 


Gin 

Ala 

lie 

lie 

Thr 

Glu 

Gly 

Leu 

Lys 

Ala 

Val 

Ala 

Ala 

Gly 

Met 

Asn 




100 





105 





110 



Pro 

Met 

Asp 

Leu 

Lys 

Arg 

Gly 

lie 

Asp 

Lys 

Ala 

Val 

Thr 

Ala 

Ala 

Val 



115 





120 





125 




Glu 

Glu 

Leu 

Lys 

Ala 

Leu 

Ser 

Val 

Pro 

Cys 

Ser 

Asp 

Ser 

Lys 

Ala 

lie 


130 





135 





140 





Ala 

Gin 

Val 

Gly 

Thr 

lie 

Ser 

Ala 

Asn 

Ser 

Asp 

Glu 

Thr 

Val 

Gly 

Lys 

145 





150 





155 





160 

Leu 

lie 

Ala 

Glu 

Ala 

Met 

Asp 

Lys 

Val 

Gly 

Lys 

Glu 

Gly Val 

lie 

Thr 





165 





170 





175 


Val 

Glu 

Asp 

Gly 

Thr 

Gly 

Leu 

Gin Asp 

Glu 

Leu 

Asp 

Val 

Val 

Glu 

Gly 




180 





185 





190 




Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe lie Asn Lys Pro 
195 200 205 
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Glu 

Thr 

Gly 

Ala 

Val 

Glu 

Leu 

Glu 

Ser 

Pro 

Phe 

lie 

Leu 

Leu 

Ala 

Asp 


210 





215 





220 





Lys 

Lys 

lie 

Ser 

Asn 

lie 

Arg 

Glu 

Met 

Leu 

Pro 

Val 

Leu 

Glu 

Ala 

Val 

225 





230 





235 





240 

Ala 

Lys 

Ala 

Gly 

Lys 

Pro 

Leu 

Leu 

lie 

lie 

Ala 

Glu 

Asp 

Val 

Glu 

Gly 





245 





250 





255 


Glu 

Ala 

Leu 

Ala 

Thr 

Ala 

Val 

Val 

Asn 

Thr 

lie 

Arg 

Gly 

lie 

Val 

Lys 




260 





265 





270 



Val 

Ala 

Ala 

Val 

Lys 

Ala 

Pro 

Gly 

Phe 

Gly 

Asp Arg 

Arg 

Lys 

Ala 

Met 



275 





280 





285 




Leu 

Gin 

Asp 

lie 

Ala 

Thr 

Leu 

Thr 

Gly 

Gly 

Thr 

Val 

lie 

Ser 

Glu 

Glu 


290 





295 





300 





He 

Gly 

Met 

Glu 

Leu 

Glu 

Lys 

Ala 

Thr 

Leu 

Glu 

Asp 

Leu 

Gly 

Gin 

Ala 

305 





310 





315 





320 

Lys 

Arg 

Val 

Val 

lie 

Asn 

Lys 

Asp 

Thr 

Thr 

Thr 

lie 

lie 

Asp 

Gly 

Val 





325 





330 





335 


Gly 

Glu 

Glu 

Ala 

Ala 

lie 

Gin 

Gly 

Arg 

Val 

Ala 

Gin 

lie 

Arg 

Gin 

Gin 




340 





345 





350 



lie 

Glu 

Glu 

Ala 

Thr 

Ser 

Asp 

Tyr 

Asp 

Arg 

Glu 

Lys 

Leu 

Gin 

Glu 

Arg 



355 





360 





365 




Val 

Ala 

Lys 

Leu 

Ala 

Gly 

Gly 

Val 

Ala 

Val 

lie 

Lys 

Val 

Gly 

Ala 

Ala 


370 





375 





380 





Thr 

Glu 

Val 

Glu 

Met 

Lys 

Glu 

Lys 

Lys 

Ala 

Arg 

Val 

Glu 

Asp 

Ala 

Leu 

385 





390 





395 





400 

His 

Ala 

Thr 

Arg 

Ala 

Ala 

Val 

Glu 

Glu 

Gly 

Val 

Val 

Ala 

Gly 

Gly 

Gly 





405 





410 





415 


Val 

Ala 

Leu 

lie 

Arg 

Val 

Ala 

Ser 

Lys 

Leu 

Ala 

Asp 

Leu 

Arg 

Gly 

Gin 




420 





425 





430 



Asn 

Glu 

Asp 

Gin 

Asn 

Val 

Gly 

lie 

Lys 

Val 

Ala 

Leu 

Arg 

Ala 

Met 

Glu 



435 





440 





445 




Ala 

Pro 

Leu 

Arg 

Gin 

lie 

Val 

Leu 

Asn 

Cys 

Gly 

Glu 

Glu 

Pro 

Ser 

Val 


450 





455 





460 





Val 

Ala 

Asn 

Thr 

Val 

Lys 

Gly 

Gly 

Asp 

Gly 

Asn 

Tyr 

Gly 

Tyr 

Asn 

Ala 

465 





470 





475 





480 

Ala 

Thr 

Glu 

Glu 

Tyr 

Gly 

Asn 

Met 

lie 

Asp 

Met 

Gly 

lie 

Leu 

Asp 

Pro 





485 





490 





495 


Thr 

Lys 

Val 

Thr 

Arg 

Ser 

Ala 

Leu 

Gin 

Tyr 

Ala 

Ala 

Ser 

Val 

Ala 

Gly 




500 





505 





510 



Leu 

Met 

lie 

Thr 

Thr 

Glu 

Cys 

Met 

Val 

Thr 

Asp 

Leu 

Pro 

Lys 

Asn 

Asp 



515 





520 





525 




Ala 

Ala 

Asp 

Leu 

Gly 

Ala 

Ala 

Gly 

Gly 

Met 

Gly Gly 

Met 

Gly 

Gly 

Met 


530 





535 





540 






Gly Gly Met Met 
545 


<210 > SEQ ID NO 3 
<211> LENGTH: 543 
<212 > TYPE: PRT 

<213 > ORGANISM: Thermoplasma acidophilum 
<220 > FEATURE: 

<221> NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: beta subunit 

<400 > SEQUENCE: 3 

Met lie Ala Gly Gin Pro lie Phe lie Leu Lys Glu 
15 10 


Gly Thr Lys Arg 
15 
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Glu Ser 


He Ser 


Met Leu 
50 

Thr lie 
65 

Val Glu 


Thr Ala 


lie Asn 


Ala Ser 
130 

Gly Ala 
145 

Asn Ser 

Tyr Glu 

Val Asp 

Asp Asp 
210 

His Pro 
225 

Asp Ala 

lie Glu 

Met Leu 

Val lie 
290 

Arg Ala 
305 

Lys Leu 

lie Ser 

Val Gly 

Ala Val 
370 

Met Glu 
385 

Glu Asp 
Ala Phe 


Gly Lys 
20 

Asn Ser 
35 

Val Asp 

Leu Lys 

Val Ser 

Val lie 
100 

Gin Asn 
115 

Glu Glu 
Asp Glu 


Lys Ser 


Ala Val 
180 

Phe Asp 
195 

Thr Gin 


Gly Met 


Pro Leu 

Asp Pro 
260 

Arg Glu 
275 

Thr Gin 

Gly lie 

Ala Lys 

Ser Ser 
340 

Glu Asp 
355 

Ser lie 

Arg Ser 

Gly Ala 

Arg Leu 
420 


Asp Ala 

Val Arg 

Ser Leu 

Glu Met 
70 

Lys Thr 
85 

lie Ala 

Val His 

Ala Lys 

Lys Ala 
150 

Ala Ser 
165 

Lys Ser 

Asn lie 

Leu lie 

Pro Asp 
230 

Glu lie 
245 

Ser Met 

Met Val 

Lys Gly 

Tyr Ala 
310 

Ala Thr 
325 

Asp Leu 

Tyr Met 

Leu Val 

lie Thr 
390 

Tyr Ala 
405 

Arg Ser 


Met Lys 

Ser Ser 
40 

Gly Asp 
55 

Asp Val 

Gin Asp 

Gly Gly 

Pro Thr 
120 

Arg Val 
135 

Leu Leu 

Val Ala 

Val Ala 

Gin Val 
200 

Asn Gly 
215 

Val Val 

Lys Lys 

lie Gin 

Asp Lys 
280 

lie Asp 
295 

Val Arg 

Gly Ala 

Gly Thr 

Thr Phe 
360 

Arg Gly 
375 

Asp Ser 
Ala Gly 
Tyr Ala 


Glu Asn 
25 

Leu Gly 

lie Val 

Glu His 

Ser Phe 
90 

Leu Leu 
105 

Val lie 

lie Asp 

Leu Lys 

Lys Asp 
170 

Glu Leu 
185 

Val Lys 

lie lie 

Lys Asp 

Pro Glu 
250 

Lys Phe 
265 

lie Lys 

Asp Met 

Arg Val 

Ser lie 
330 

Ala Glu 
345 

Val Thr 

Glu Thr 

Leu His 

Gly Gly 
410 

Gin Lys 
425 


lie Glu 

Pro Arg 

lie Thr 
60 

Pro Ala 
75 

Val Gly 

Gin Gin 

Ser Glu 

Glu lie 
140 

Met Ala 
155 

Lys Leu 

Arg Asp 

Lys Gin 

Val Asp 
220 

Ala Lys 
235 

Phe Asp 
Leu Ala 


Ser Val 

Ala Gin 
300 

Lys Lys 
315 

Val Ser 

Arg Val 

Gly Cys 

Glu His 
380 

Val Val 
395 

Ala Thr 
lie Gly 


Ala 

Ala 

30 

Gly 

45 

Met 

Asn 

Asp 

Ala 

Lys 

Asp 

Gly 

Ala 

Gin 

110 

Gly 

125 

Tyr 

Ser 

Thr 

Gin 

Thr 

Ala 

Glu 

Gly 

Lys 

190 

Gly 

205 

Gly 

Lys 

Glu 

lie 

Ala 

Thr 

Asn 

Gin 

Glu 

270 

Gly 

285 

Ala 

His 

Tyr 

Ser 

Asp 

Thr 

lie 

Glu 

Gin 

350 

Lys 

365 

Asn 

Val 

Val 

Ala 

Ser 

Ala 

Ala 

Gly 

Arg 

430 


lie Ala 

Asp Lys 

Gly Val 

Met Met 
80 

Thr Thr 
95 

Gly Leu 

Arg Met 

Lys lie 

Ser Leu 
160 

lie Ser 
175 

Tyr Tyr 
Ala lie 


Lys Val 


Leu Leu 
240 

Leu Arg 
255 

Glu Asn 


Asn Val 

Leu Ser 

Met Asp 
320 

Asp Glu 
335 

Val Lys 

Pro Lys 

Asp Glu 

Ala Leu 
400 

Glu lie 
415 

Gin Gin 
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Leu 

Ala 

lie 

Glu 

Lys 

Phe 

Ala 

Asp 

Ala 

lie 

Glu 

Glu 

lie 

Pro 

Arg 

Ala 



435 





440 





445 




Leu 

Ala 

Glu 

Asn 

Ala 

Gly 

Leu 

Asp 

Pro 

lie 

Asp 

lie 

Leu 

Leu 

Lys 

Leu 


450 





455 





460 





Arg 

Ala 

Glu 

His 

Ala 

Lys 

Gly 

Asn 

Lys 

Thr 

Tyr 

Gly 

lie 

Asn 

Val 

Phe 

465 





470 





475 





480 

Thr 

Gly 

Glu 

lie 

Glu 

Asp 

Met 

Val 

Lys 

Asn 

Gly 

Val 

lie 

Glu 

Pro 

lie 





485 





490 





495 


Arg 

Val 

Gly 

Lys 

Gin 

Ala 

lie 

Glu 

Ser 

Ala 

Thr 

Glu 

Ala 

Ala 

lie 

Met 




500 





505 





510 



He 

Leu 

Arg 

lie 

Asp 

Asp 

Val 

lie 

Ala 

Thr 

Lys 

Ser 

Ser 

Ser 

Ser 

Ser 



515 





520 





525 




Ser 

Asn 

Pro 

Pro 

Lys 

Ser 

Gly 

Ser 

Ser 

Ser 

Glu 

Ser 

Ser 

Glu 

Asp 



530 535 540 


< 2 10 > SEQ ID NO 4 
< 2 11 > LENGTH: 544 
<212 > TYPE: PRT 

<213> ORGANISM: Cyanobacter ial synechococcus 
<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FE ATURE 


<223 > OTHER 

INFORMATION 

: Cyanobacterial 

HSP60 





< 4 00 > SEQUENCE: 

4 












Met 

Ala 

Lys 

Arg 

lie 

lie 

Tyr 

Asn 

Glu 

Asn 

Ala 

Arg 

Arg 

Ala 

Leu 

Glu 

1 




5 





10 





15 


Lys 

Gly 

lie 

Asp 

lie 

Leu 

Ala 

Glu 

Ala 

Val 

Ala 

Val 

Thr 

Leu 

Gly 

Pro 




20 





25 





30 



Lys 

Gly 

Arg 

Asn 

Val 

Val 

Leu 

Glu 

Lys 

Lys 

Phe 

Gly 

Ala 

Pro 

Gin 

lie 



35 





40 





45 




lie 

Asn 

Asp 

Gly 

Val 

Thr 

lie 

Ala 

Lys 

Glu 

lie 

Glu 

Leu 

Glu 

Asp 

His 


50 





55 





60 





lie 

Glu 

Asn 

Thr 

Gly 

Val 

Ala 

Leu 

lie 

Arg 

Gin 

Ala 

Ala 

Ser 

Lys 

Thr 

65 





70 





75 





80 

Asn 

Asp 

Ala 

Ala 

Gly 

Asp 

Gly 

Thr 

Thr 

Thr 

Ala 

Thr 

Val 

Leu 

Ala 

His 





85 





90 





95 


Ala 

Val 

Val 

Lys 

Glu 

Gly 

Leu 

Arg 

Asn 

Val 

Ala 

Ala 

Gly 

Ala 

Asn 

Ala 




100 





105 





110 



lie 

Leu 

Leu 

Lys 

Arg 

Gly 

lie 

Asp 

Lys 

Ala 

Thr 

Asn 

Phe 

Leu 

Val 

Glu 



115 





120 





125 




Gin 

lie 

Lys 

Ser 

His 

Ala 

Arg 

Pro 

Val 

Glu 

Asp 

Ser 

Lys 

Ser 

lie 

Ala 


130 





135 





140 





Gin 

Val 

Gly 

Ala 

lie 

Ser 

Ala 

Gly 

Asn 

Asp 

Phe 

Glu 

Val 

Gly 

Gin 

Met 

145 





150 





155 





160 

lie 

Ala 

Asp 

Ala 

Met 

Asp 

Lys 

Val 

Gly 

Lys 

Glu 

Gly 

Val 

lie 

Ser 

Leu 





165 





170 





175 


Glu 

Glu 

Gly 

Lys 

Ser 

Met 

Thr 

Thr 

Glu 

Leu 

Glu 

Val 

Thr 

Glu 

Gly 

Met 




180 





185 





190 



Arg 

Phe 

Asp 

Lys 

Gly 

Tyr 

lie 

Ser 

Pro 

Tyr 

Phe 

Ala 

Thr 

Asp 

Thr 

Glu 



195 





200 





205 




Arg 

Met 

Glu 

Ala 

Val 

Phe 

Asp 

Glu 

Pro 

Phe 

lie 

Leu 

lie 

Thr 

Asp 

Lys 


210 





215 





220 





Lys 

lie 

Gly 

Leu 

Val 

Gin 

Asp 

Leu 

Val 

Pro 

Val 

Leu 

Glu 

Gin 

Val 

Ala 

225 





230 





235 





240 


Arg Ala Gly Arg Pro Leu Val lie lie Ala Glu Asp lie Glu Lys Glu 
245 250 255 
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Ala 

Leu 

Ala 

Thr 

Leu 

Val 

Val 

Asn 

Arg 

Leu 

Arg 

Gly 

Val 

Leu 

Asn 

Val 




260 





265 





270 



Ala 

Ala 

Val 

Lys 

Ala 

Pro 

Gly 

Phe 

Gly 

Asp 

Arg 

Arg 

Lys 

Ala 

Met 

Leu 



275 





280 





285 




Glu Asp 

lie 

Ala 

Val 

Leu 

Thr 

Gly 

Gly 

Gin 

Leu 

lie 

Thr 

Glu 

Asp 

Ala 


290 





295 





300 





Ala 

Arg 

Lys 

Leu 

Asp 

Thr 

Thr 

Lys 

Leu 

Asp 

Gin 

Leu 

Gly 

Lys 

Ala 

Arg 

305 





310 





315 





320 

Arg 

lie 

Thr 

lie 

Thr 

Lys 

Asp 

Asn 

Thr 

Thr 

lie 

Val 

Ala 

Glu 

Gly 

Asn 





325 





330 





335 


Glu 

Ala 

Ala 

Val 

Lys 

Ala 

Arg 

Val 

Asp 

Gin 

lie 

Arg 

Arg 

Gin 

lie 

Glu 




340 





345 





350 



Glu 

Thr 

Glu 

Ser 

Ser 

Tyr 

Asp 

Lys 

Glu 

Lys 

Leu 

Gin 

Glu 

Arg 

Leu 

Ala 



355 





360 





365 




Lys 

Leu 

Ser 

Gly 

Gly 

Val 

Ala 

Val 

Val 

Lys 

Val 

Gly 

Ala 

Ala 

Thr 

Glu 


370 





375 





380 





Thr 

Glu 

Met 

Lys 

Asp 

Arg 

Lys 

Leu 

Arg 

Leu 

Glu 

Asp 

Ala 

lie 

Asn 

Ala 

385 





390 





395 





400 

Thr 

Lys 

Ala 

Ala 

Val 

Glu 

Glu 

Gly 

lie 

Val 

Pro 

Gly 

Gly 

Gly 

Thr 

Thr 





405 





410 





415 


Leu 

Ala 

His 

Leu 

Ala 

Pro 

Gin 

Leu 

Glu 

Glu 

Trp 

Ala 

Thr 

Ala 

Asn 

Leu 




420 





425 





430 



Ser 

Gly 

Glu 

Glu 

Leu 

Thr 

Gly 

Ala 

Gin 

lie 

Val 

Ala 

Arg 

Ala 

Leu 

Thr 



435 





440 





445 




Ala 

Arg 

Leu 

Lys 

Arg 

lie 

Ala 

Glu 

Asn 

Ala 

Gly 

Leu 

Asn 

Gly 

Ala 

Val 


450 





455 





460 





He 

Ser 

Glu 

Arg 

Val 

Lys 

Glu 

Leu 

Pro 

Phe 

Asp 

Glu 

Gly 

Tyr 

Asp 

Ala 

465 





470 





475 





480 

Ser 

Asn 

Asn 

Gin 

Phe 

Val 

Asn 

Met 

Phe 

Thr 

Ala 

Gly 

lie 

Val 

Asp 

Pro 





485 





490 





495 


Ala 

Lys 

Val 

Thr 

Arg 

Ser 

Ala 

Leu 

Gin 

Asn 

Ala 

Ala 

Ser 

lie 

Ala 

Ala 




500 





505 





510 



Met 

Val 

Leu 

Thr 

Thr 

Glu 

Cys 

lie 

Val 

Val 

Asp 

Lys 

Pro 

Glu 

Pro 

Lys 



515 





520 





525 




Glu 

Lys 

Ala 

Pro 

Ala 

Gly 

Ala 

Gly 

Gly 

Gly 

Met 

Gly 

Asp 

Phe 

Asp 

Tyr 


530 





535 





540 





<210 > SEQ ID NO 

5 












<211> LENGTH: 535 












<212 > TYPE: 

PRT 













<213 > ORGANISM: 

Methanosarcina acetivorans 






<220 > FEATURE: 













<221> NAME/KEY: 

MIS C_FE ATURE 









<223 > OTHER 

INFORMATION: 

: acetivoran 

HSP60-4 






<400 > SEQUENCE: 

5 












Met 

Ala 

Ser 

Glu 

Leu 

Lys 

Thr 

Pro 

Gly 

Asn 

Thr 

Ser 

Pro 

Glu 

Ser 

Gin 

1 




5 





10 





15 


Asp 

Gly 

Met 

Ala 

Lys 

Leu 

Ala 

Arg 

Thr 

lie 

Arg 

Asp 

Lys 

lie 

Leu 

lie 




20 





25 





30 



Asp 

Glu 

Pro 

Val 

Lys 

Glu 

Glu 

Glu 

Leu 

lie 

Asp 

Gin 

Leu 

Glu 

Arg 

Ala 



35 





40 





45 




Ala 

lie 

Glu 

lie 

Asp 

Glu 

Leu 

Leu 

Gly 

Ser 

Ser 

Leu 

Gly 

Pro 

Lys 

Gly 


50 





55 





60 





Met 

Asn 

Lys 

lie 

lie 

Val 

Asn 

Pro 

Val 

Gly 

Asp 

lie 

Phe 

Val 

Thr 

Ser 
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-continued 


65 

Asp Gly 

Thr Ser 

Gly Thr 

Val Arg 
130 

Tyr Glu 
145 

Arg Gin 

Thr Gly 

Ala Leu 

Asp Leu 
210 

lie Val 
225 

Asp Met 


Asp Leu 


Met Asp 


Leu Cys 
290 

Leu Phe 
305 

Asp Ser 

Lys Leu 

lie Arg 

Lys Asn 
370 

Ala Thr 
385 

Glu Glu 

Asn Arg 

His Met 

Ala Val 
450 

Ala Arg 
465 


Lys Val 

Leu Lys 
100 

Lys Thr 
115 

Leu lie 

Leu Ala 

Ala Ser 

Lys Gly 
180 

Lys Val 
195 

Asn Arg 

Ala He 

Pro Lys 

Lys lie 
260 

Ser Val 
275 

Gly Glu 

Ser Glu 

Asn lie 

Ala Glu 
340 

Pro Cys 
355 

Gly Glu 

lie Leu 

Ala Val 

Glu lie 
420 

Val Arg 
435 

Gin Ala 
Asn lie 


70 

lie Leu 
85 

Lys Leu 

Ala Val 

Arg Ala 

Met Gin 
150 

Glu Glu 
165 

lie Glu 

lie Ser 

Asn Val 

Glu Gly 
230 

Ser Tyr 
245 

Lys Ser 

Gin Thr 

lie Ala 

Gly Asp 
310 

Leu Ala 
325 

Ala Thr 

Asp Leu 

Asn Phe 

lie Arg 
390 

Asp Asp 
405 

Val Asn 

Leu Phe 

Tyr Ala 

Gly Met 
470 


Lys 

Glu 

Ala 

Glu 

lie 

Phe 

120 

Gly 

135 

Val 

Lys 

Thr 

Asp 

lie 

Arg 

Gin 

His 

Leu 

200 

Lys 

215 

lie 

Leu 

lie 

Gin 

Asn 

Gly 

Tyr 

Ala 

Leu 

280 

Arg 

295 

Lys 

lie 

Asp 

Phe 

Lys 

Gly 

Thr 

Gly 

Arg 

360 

Val 

375 

Phe 

Glu 

Pro 

Ala 

Leu 

Gly 

Gly 

Ala 

Ala 

440 

Glu 

455 

Ala 

Asn 

Glu 


lie Asp 
90 

Ser Met 
105 

Ala Ser 

His Pro 

Tyr Glu 

Arg Thr 
170 

Gin Ala 
185 

Ser Glu 

Leu Lys 

Met Asp 

Pro Ala 
250 

Leu Asn 
265 

Leu Phe 

lie lie 

Pro Tyr 

Lys Leu 
330 

Thr Leu 
345 

Ala Gly 

lie Thr 

Val Lys 

Asn Asn 
410 

Gly Ala 
425 

Thr Gin 
Leu Glu 
lie Asp 


75 

Val Leu 

Asp Lys 

Asn Leu 

Thr lie 
140 

Met Leu 
155 

Thr lie 

Gin Ala 

Lys Gin 

Lys Lys 
220 

Glu Asn 
235 

Val Leu 

Pro Gin 

Glu Glu 

Asp Ser 
300 

lie Glu 
315 

Lys Met 

Met Ala 

Ser lie 

Val Lys 
380 

Tyr Gly 
395 

Ala Ala 

lie Glu 

Thr Gly 

Lys lie 
460 

Ala Met 
475 


His Pro 

Ala Cys 
110 

lie Lys 
125 

lie lie 

Gin Tyr 

Met Cys 

Val Thr 
190 

Ala Gly 
205 

Gly Gly 

Pro Ala 

lie Thr 

His Asn 
270 

Arg Lys 
285 

Gly Ala 

Thr Leu 

Lys Asp 

Gin Pro 
350 

Lys Leu 
365 

Asp Lys 

Leu Asp 

Phe Leu 

Phe Glu 
430 

Lys Arg 
445 

Pro Val 
Ala Gin 


80 

lie Val 
95 

Gly Asp 

Asn Ala 

Glu Gly 

Ser lie 
160 

Ser Ala 
175 

Glu lie 

Arg lie 

Pro Glu 

Arg Glu 
240 

Asn Tyr 
255 

Phe Lys 

Lys Gin 

Asn Val 

Leu Arg 
320 

Leu Glu 
335 

Asp Glu 

Glu Lys 

Ala lie 

Lys Val 
400 

Arg Lys 
415 

Leu Ala 

Gin Leu 

lie Leu 

Met Arg 
480 


Asn Ser Tyr Ala Arg Gly Leu Glu Ala Arg lie Asp Leu Ser Arg Lys 
485 490 495 
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Val 

Thr 

Asp 

Arg 

Gly 

Pro 

Glu 

Val 

Tyr 

Asp 

Ser 

Ala 

Thr 

Val 

Lys 

Lys 




500 





505 





510 



Leu 

Ala 

He 

lie 

Ala 

Gly 

Thr 

Glu 

Thr 

Ala 

Lys 

Lys 

Val 

Leu 

Arg 

lie 



515 





520 





525 




Asp 

Glu 

lie 

Val 

Pro 

Lys 

Lys 











530 535 


<210 > SEQ ID NO 6 
<211 > LENGTH: 540 
<212 > TYPE: PRT 

<213 > ORGANISM: Methanosarcina tuberculosis 
<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: tuberculosis HSP65 


< 4 00 > SEQUENCE: 6 


Met 

Ala 

Lys 

Thr 

lie 

Ala 

Tyr 

Asp 

Glu 

Glu 

Ala 

Arg 

Arg 

Gly 

Leu 

Glu 

1 




5 





10 





15 


Arg 

Gly 

Leu 

Asn 

Ala 

Leu 

Ala 

Asp 

Ala 

Val 

Lys 

Val 

Thr 

Leu 

Gly 

Pro 




20 





25 





30 



Lys 

Gly 

Arg 

Asn 

Val 

Val 

Leu 

Glu 

Lys 

Lys 

Trp 

Gly 

Ala 

Pro 

Thr 

lie 



35 





40 





45 




Thr 

Asn 

Asp 

Gly 

Val 

Ser 

lie 

Ala 

Lys 

Glu 

lie 

Glu 

Leu 

Glu 

Asp 

Pro 


50 





55 





60 





Tyr 

Glu 

Lys 

lie 

Gly 

Ala 

Glu 

Leu 

Val 

Lys 

Glu 

Val 

Ala 

Lys 

Lys 

Thr 

65 





70 





75 





80 

Asp 

Asp 

Val 

Ala 

Gly 

Asp 

Gly 

Thr 

Thr 

Thr 

Ala 

Thr 

Val 

Leu 

Ala 

Gin 





85 





90 





95 


Ala 

Leu 

Val 

Arg 

Glu 

Gly 

Leu 

Arg 

Asn 

Val 

Ala 

Ala 

Gly 

Ala 

Asn 

Pro 




100 





105 





110 



Leu 

Gly 

Leu 

Lys 

Arg 

Gly 

lie 

Glu 

Lys 

Ala 

Val 

Glu 

Lys 

Val 

Thr 

Glu 



115 





120 





125 




Thr 

Leu 

Leu 

Lys 

Gly 

Ala 

Lys 

Glu 

Val 

Glu 

Thr 

Lys 

Glu 

Gin 

lie 

Ala 


130 





135 





140 





Ala 

Thr 

Ala 

Ala 

lie 

Ser 

Ala 

Gly 

Asp 

Gin 

Ser 

lie 

Gly 

Asp 

Leu 

lie 

145 





150 





155 





160 

Ala 

Glu 

Ala 

Met 

Asp 

Lys 

Val 

Gly 

Asn 

Glu 

Gly 

Val 

lie 

Thr 

Val 

Glu 





165 





170 





175 


Glu 

Ser 

Asn 

Thr 

Phe 

Gly 

Leu 

Gin 

Leu 

Glu 

Leu 

Thr 

Glu 

Gly 

Met 

Arg 




180 





185 





190 



Phe 

Asp 

Lys 

Gly 

Tyr 

lie 

Ser 

Gly 

Tyr 

Phe 

Val 

Thr 

Asp 

Pro 

Glu 

Arg 



195 





200 





205 




Gin 

Glu 

Ala 

Val 

Leu 

Glu 

Asp 

Pro 

Tyr 

lie 

Leu 

Leu 

Val 

Ser 

Ser 

Lys 


210 





215 





220 





Val 

Ser 

Thr 

Val 

Lys 

Asp 

Leu 

Leu 

Pro 

Leu 

Leu 

Glu 

Lys 

Val 

lie 

Gly 

225 





230 





235 





240 

Ala 

Gly 

Lys 

Pro 

Leu 

Leu 

lie 

lie 

Ala 

Glu 

Asp 

Val 

Glu 

Gly 

Glu 

Ala 





245 





250 





255 


Leu 

Ser 

Thr 

Leu 

Val 

Val 

Asn 

Lys 

lie 

Arg 

Gly 

Thr 

Phe 

Lys 

Ser 

Val 




260 





265 





270 



Ala 

Val 

Lys 

Ala 

Pro 

Gly 

Phe 

Gly 

Asp 

Arg 

Arg 

Lys 

Ala 

Met 

Leu 

Gin 



275 





280 





285 




Asp 

Met 

Ala 

lie 

Leu 

Thr 

Gly 

Gly 

Gin 

Val 

lie 

Ser 

Glu 

Glu 

Val 

Gly 


290 





295 





300 






Leu Thr Leu Glu Asn Ala Asp Leu Ser Leu Leu Gly Lys Ala Arg Lys 
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305 310 315 320 


Val 

Val 

Val 

Thr 

Lys 

Asp 

Glu 

Thr 

Thr 

lie 

Val 

Glu 

Gly 

Ala 

Gly Asp 





325 





330 





335 


Thr 

Asp 

Ala 

lie 

Ala 

Gly 

Arg 

Val 

Ala 

Gin 

lie 

Arg 

Gin 

Glu 

lie 

Glu 




340 





345 





350 



Asn 

Ser 

Asp 

Ser 

Asp 

Tyr 

Asp 

Arg 

Glu 

Lys 

Leu 

Gin 

Glu 

Arg 

Leu 

Ala 



355 





360 





365 




Lys 

Leu 

Ala 

Gly 

Gly 

Val 

Ala 

Val 

lie 

Lys 

Ala 

Gly 

Ala 

Ala 

Thr 

Glu 


370 





375 





380 





Val 

Glu 

Leu 

Lys 

Glu 

Arg 

Lys 

His 

Arg 

lie 

Glu 

Asp 

Ala 

Val 

Arg 

Asn 

385 





390 





395 





400 

Ala 

Lys 

Ala 

Ala 

Val 

Glu 

Glu 

Gly 

lie 

Val 

Ala 

Gly 

Gly 

Gly 

Val 

Thr 





405 





410 





415 


Leu 

Leu 

Gin 

Ala 

Ala 

Pro 

Thr 

Leu 

Asp 

Glu 

Leu 

Lys 

Leu 

Glu 

Gly Asp 




420 





425 





430 



Glu 

Ala 

Thr 

Gly 

Ala 

Asn 

lie 

Val 

Lys 

Val 

Ala 

Leu 

Glu 

Ala 

Pro 

Leu 



435 





440 





445 




Lys 

Gin 

He 

Ala 

Phe 

Asn 

Ser 

Gly 

Leu 

Glu 

Pro 

Gly 

Val 

Val 

Ala 

Glu 


450 





455 





460 





Lys 

Val 

Arg 

Asn 

Leu 

Pro 

Ala 

Gly 

His 

Gly 

Leu 

Asn 

Ala 

Gin 

Thr 

Gly 

465 





470 





475 





480 

Val 

Tyr 

Glu 

Asp 

Leu 

Leu 

Ala 

Ala 

Gly 

Val 

Ala 

Asp 

Pro 

Val 

Lys 

Val 





485 





490 





495 


Thr 

Arg 

Ser 

Ala 

Leu 

Gin 

Asn 

Ala 

Ala 

Ser 

lie 

Ala 

Gly 

Leu 

Phe 

Leu 




500 





505 





510 



Thr 

Thr 

Glu 

Ala 

Val 

Val 

Ala 

Asp 

Lys 

Pro 

Glu 

Lys 

Glu 

Lys 

Ala 

Ser 



515 





520 





525 




Val 

Pro 

Gly 

Gly 

Gly 

Asp 

Met 

Gly 

Gly 

Met 

Asp 

Phe 






530 





535 





540 






<210 > SEQ ID NO 7 
<211> LENGTH: 554 
<212 > TYPE : PRT 

<213 > ORGANISM: Aeropyrum pernix 
<220 > FEATURE: 

<22 1> NAME /KEY: MIS C_FE ATURE 

<223 > OTHER INFORMATION: Alpha subunit 


< 4 00 > SEQUENCE: 7 


Met 

Ala 

Ala 

Thr 

Gly 

Tyr 

Pro 

Val 

Leu 

lie 

Leu 

Lys 

Glu 

Gly 

Thr 

Gin 

1 




5 





10 





15 


Arg 

Thr 

Tyr 

Gly 

Arg 

Glu 

Ala 

Leu 

Arg 

Ala 

Asn 

lie 

Leu 

Ala 

Ala 

Arg 




20 





25 





30 



Val 

Leu 

Ala 

Glu 

Met 

Leu 

Lys 

Ser 

Ser 

Leu Gly 

Pro 

Arg 

Gly 

Leu 

Asp 



35 





40 





45 




Lys 

Met 

Leu 

Val 

Asp 

Ala 

Phe 

Gly 

Asp 

lie 

Thr 

Val 

Thr 

Asn 

Asp 

Gly 


50 





55 





60 





Ala 

Thr 

lie 

Val 

Lys 

Glu 

Met 

Glu 

lie 

Gin 

His 

Pro 

Ala 

Ala 

Lys 

Leu 

65 





70 





75 





80 

Leu 

Val 

Glu 

Val 

Ala 

Lys 

Ala 

Gin 

Asp 

Ala 

Glu 

Val 

Gly 

Asp 

Gly 

Thr 





85 





90 





95 


Thr 

Ser 

Val 

Val 

Val 

Leu 

Ala 

Gly 

Ala 

Leu 

Leu 

Glu 

Lys 

Ala 

Glu 

Lys 




100 





105 





110 




Leu Leu Asp Glu Asn Leu His Pro Thr lie lie lie Glu Gly Tyr Thr 
115 120 125 
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Lys 

Ala 

Met 

Glu 

Glu 

Ala 

Leu 

Arg 

Leu 

Val 

Asp 

Glu 

Ala 

Ala 

Val 

Pro 


13 0 





135 





140 





Val 

Glu 

Val 

Glu 

Asp 

Asp 

Ser 

Val 

Leu 

Arg 

Arg 

lie 

Ala 

Glu 

Thr 

Thr 

145 





150 





155 





160 

Leu 

Ala 

Ser 

Lys 

Phe 

Val 

Gly 

Thr 

Gly 

Pro 

Glu 

Arg 

Asp 

Lys 

lie 

lie 





165 





170 





175 


Ser 

Met 

Val 

lie 

Asp 

Ala 

lie 

Arg 

Thr 

Val 

Ala 

Glu 

Lys 

Arg 

Pro 

Asp 




180 





185 





190 



Gly Gly 

Tyr 

Glu 

Val 

Asp 

Leu 

Asp 

Tyr 

Val 

Lys 

lie 

Glu 

Lys 

Lys 

Lys 



195 





200 





205 




Gly Gly 

Ser 

Leu 

Leu 

Asp 

Ser 

Lys 

Leu 

Val 

Arg 

Gly 

lie 

Val 

Leu 

Asp 


210 





215 





220 





Lys 

Glu 

Val 

Val 

His 

Pro 

Ala 

Met 

Pro 

Lys 

Arg 

Val 

Glu 

Asn 

Ala 

Lys 

225 





230 





235 





240 

He 

Leu 

Val 

Leu 

Asp 

Ala 

Pro 

Leu 

Glu 

Val 

Gin 

Lys 

Pro 

Glu 

Leu 

Thr 





245 





250 





255 


Thr 

Lys 

lie 

Arg 

Val 

Thr 

Asp 

lie 

Glu 

Lys 

Leu 

Glu 

Ser 

Phe 

Leu 

Glu 




260 





265 





270 



Glu 

Glu 

Thr 

Arg 

Met 

Leu 

Arg 

Asp 

Met 

Val 

Glu 

Lys 

lie 

Ala 

Ala 

Thr 



275 





280 





285 




Gly Ala 

Asn 

Val 

Val 

lie 

Thr 

Gin 

Lys 

Gly 

lie 

Asp 

Glu 

Val 

Ala 

Gin 


290 





295 





300 





His 

Phe 

Leu 

Ala 

Lys 

Lys 

Gly 

lie 

Leu 

Ala 

Val 

Arg 

Arg 

Val 

Lys 

Arg 

305 





310 





315 





320 

Ser 

Asp 

lie 

Glu 

Lys 

Val 

Ala 

Lys 

Ala 

Thr 

Gly 

Ala 

Lys 

lie 

Val 

Thr 





325 





330 





335 


Ser 

Leu 

Arg 

Asp 

Leu 

Lys 

Pro 

Glu 

Tyr 

Leu 

Gly 

Tyr 

Ala 

Glu 

Leu 

Val 




340 





345 





350 



Glu 

Glu 

Arg 

Lys 

Val 

Gly 

Glu 

Asp 

Lys 

Met 

Val 

Phe 

lie 

Glu 

Gly 

Ala 



355 





360 





365 




Lys 

Asn 

Pro 

Lys 

Ser 

Val 

Thr 

lie 

Leu 

Leu 

Arg 

Gly 

Ala 

Asn 

Asp 

Met 


370 





375 





380 





Leu 

Leu 

Asp 

Glu 

Ala 

Glu 

Arg 

Asn 

lie 

Lys 

Asp 

Ala 

Leu 

His 

Gly 

Leu 

385 





390 





395 





400 

Arg 

Asn 

lie 

Leu 

Arg 

Glu 

Pro 

Lys 

lie 

Val 

Gly 

Gly 

Gly 

Gly 

Ala 

Val 





405 





410 





415 


Glu 

Val 

Glu 

Leu 

Ala 

Leu 

Lys 

Leu 

Lys 

Glu 

Phe 

Ala 

Arg 

Thr 

Val 

Gly 




420 





425 





430 



Gly 

Lys 

Gin 

Gin 

Leu 

Ala 

lie 

Glu 

Ala 

Tyr 

Ala 

Glu 

Ala 

Leu 

Glu 

Thr 



435 





440 





445 




lie 

Pro 

Thr 

Val 

Leu 

Ala 

Glu 

Ser 

Ala 

Gly 

Met 

Asp 

Ala 

Leu 

Glu 

Ala 


450 





455 





460 





Leu 

Leu 

Lys 

Leu 

Arg 

Ser 

Leu 

His 

Ser 

Gin 

Gly 

Tyr 

Lys 

Phe 

Ala 

Gly 

465 





470 





475 





480 

Val 

Asn 

Val 

Leu 

Glu 

Gly 

Lys 

lie 

Glu 

Glu 

Asp 

Met 

Thr 

Lys 

lie 

Asn 





485 





490 





495 


Val 

Tyr 

Glu 

Pro 

Val 

Leu 

Val 

Lys 

Lys 

Gin 

Val 

lie 

Lys 

Ser 

Ala 

Ser 




500 





505 





510 



Glu 

Ala 

Ala 

lie 

Ser 

lie 

Leu 

Lys 

lie 

Asp 

Asp 

Val 

lie 

Ala 

Ala 

Ala 



515 





520 





525 




Pro 

Pro 

Lys 

Lys 

Lys 

Glu 

Lys 

Lys 

Gly 

Lys 

Thr 

Gly 

Glu 

Glu 

Glu 

Glu 


530 535 540 


Glu Glu Gly Gly Gly Ser Lys Phe Glu Phe 
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545 550 


<210 > SEQ ID NO 8 
<211> LENGTH: 542 
<212 > TYPE: PRT 

<213 > ORGANISM: Aeropyrum mazei 
<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: Alpha subunit 

< 4 00 > SEQUENCE: 8 


Met 

Ala 

Ala 

Gin 

Pro 

lie 

Phe 

lie 

Leu 

Arg 

Glu 

Gly 

Ser 

Lys 

Arg 

Thr 

1 




5 





10 





15 


His 

Gly 

Ser 

Asp 

Ala 

Gin 

His 

Asn 

Asn 

lie 

Met 

Ala 

Ala 

Lys 

Ala 

Val 




20 





25 





30 



Ala 

Glu 

Ala 

Val 

Arg 

Thr 

Thr 

Leu 

Gly 

Pro 

Lys 

Gly 

Met 

Asp 

Lys 

Met 



35 





40 





45 




Leu 

Val 

Asp 

Ala 

Met 

Gly 

Asp 

Val 

Val 

lie 

Thr 

Asn 

Asp 

Gly 

Ala 

Thr 


50 





55 





60 





He 

Leu 

Lys 

Glu 

Met 

Asp 

lie 

Glu 

His 

Pro 

Gly 

Ala 

Lys 

Met 

lie 

Val 

65 





70 





75 





80 

Glu 

Val 

Ala 

Lys 

Thr 

Gin 

Asp 

Ala 

Glu 

Val 

Gly 

Asp 

Gly 

Thr 

Thr 

Thr 





85 





90 





95 


Ala 

Ala 

Val 

Leu 

Ala 

Gly 

Glu 

Leu 

Leu 

Thr 

Lys 

Ala 

Glu 

Asp 

Leu 

Leu 




100 





105 





110 



Glu 

Ser 

Gly 

Val 

His 

Pro 

Thr 

Val 

lie 

Ala 

Ser 

Gly 

Tyr 

Arg 

Leu 

Ala 



115 





120 





125 




Ala 

lie 

Gin 

Ala 

Val 

Lys 

lie 

Leu 

Asp 

Thr 

lie 

Thr 

lie 

Ser 

Ala 

Ser 


130 





135 





140 





Pro 

Glu 

Asp 

Thr 

Glu 

Thr 

Leu 

Glu 

Lys 

lie 

Ala 

Gly 

Thr 

Ala 

lie 

Thr 

145 





150 





155 





160 

Gly 

Lys 

Gly 

Ala 

Glu 

Ser 

His 

Lys 

Ala 

His 

Leu 

Ser 

Asn 

Leu 

Ala 

Val 





165 





170 





175 


Arg 

Ala 

lie 

Lys 

Ser 

lie 

Val 

Glu 

Lys 

Asp 

Glu 

Asn 

Gly 

Lys 

lie 

Thr 




180 





185 





190 



Val 

Asp 

lie 

Glu 

Asp 

Val 

Lys 

Thr 

Glu 

Lys 

Arg 

Pro 

Gly 

Gly 

Ser 

lie 



195 





200 





205 




Lys 

Asp 

Ser 

Glu 

lie 

Val 

Glu 

Gly 

Val 

lie 

Val 

Asp 

Lys 

Glu 

Arg 

Val 


210 





215 





220 





His 

Thr 

Gly 

Met 

Pro 

Glu 

Val 

Val 

Lys 

Asp 

Ala 

Lys 

Val 

Leu 

Leu 

Leu 

225 





230 





235 





240 

Ser 

Val 

Pro 

lie 

Glu 

Leu 

Lys 

Lys 

Thr 

Glu 

Thr 

Lys 

Ala 

Glu 

lie 

Lys 





245 





250 





255 


lie 

Thr 

Thr 

Pro 

Asp 

Gin 

Met 

Gin 

Leu 

Phe 

Leu 

Asp 

Gin 

Glu 

Glu 

Ala 




260 





265 





270 



Met 

Leu 

Arg 

Glu 

lie 

Val 

Asp 

Lys 

Val 

lie 

Asp 

Thr 

Gly 

Ala 

Asn 

Val 



275 





280 





285 




Val 

Phe 

Cys 

Gin 

Lys 

Gly 

lie 

Asp 

Asp 

Leu 

Ala 

Gin 

Tyr 

Tyr 

Leu 

Thr 


290 





295 





300 





Lys 

Ala 

Gly 

lie 

Phe 

Ala 

Met 

Arg 

Arg 

Val 

Lys 

Lys 

Ser 

Asp 

Met 

Asp 

305 





310 





315 





320 

Lys 

Leu 

Ser 

Arg 

Ala 

Thr 

Gly 

Gly 

Arg 

lie 

lie 

Thr 

Asn 

Leu 

Asp 

Glu 


325 330 335 


lie Asp Glu Ser Asp Leu Gly Tyr Ala Gly Met Val Glu Glu Lys Asp 
340 345 350 
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Val 

Thr 

Gly 

Ser 

Arg 

Met 

Thr 

Phe 

Val 

Thr 

Gly 

Cys 

Lys 

Asp 

Ser 

Lys 



355 





360 





365 




Thr 

Thr 

Ser 

lie 

Leu 

Leu 

Arg 

Gly 

Gly 

Thr 

Glu 

His 

Val 

Val 

Asp 

Gly 


370 





375 





380 





Leu 

Glu 

Arg 

Ala 

Leu 

Glu 

Asp 

Ala 

Leu 

Arg 

Val 

Val 

Gly 

Val 

Ala 

Leu 

385 





390 





395 





400 

Glu Asp 

Gin 

Lys 

lie 

Val 

Val 

Gly 

Gly 

Gly 

Ser 

Pro 

Glu 

lie 

Glu 

Leu 





405 





410 





415 


Ser 

Leu 

Arg 

Leu 

Lys 

Glu 

Tyr 

Ala 

Ala 

Thr 

Leu 

Lys 

Gly 

Arg 

Glu 

Gin 




420 





425 





430 



Leu 

Ala 

Val 

Thr 

Lys 

Phe 

Ala 

Glu 

Ser 

Leu 

Glu 

Val 

lie 

Pro 

Gin 

Thr 



435 





440 





445 




Leu 

Ala 

Glu 

Asn 

Ala 

Gly 

Leu 

Asp 

Pro 

lie 

Asp 

Met 

Leu 

Val 

Glu 

Met 


450 





455 





460 





Arg 

Ser 

Gin 

His 

Glu 

Lys 

Gly 

Asn 

Lys 

Arg 

Ala 

Gly 

Leu 

Asn 

Val 

Tyr 

465 





470 





475 





480 

Lys 

Gly 

Lys 

lie 

Glu 

Asp 

Met 

Phe 

Glu 

Asn 

Asn 

Val 

Val 

Glu 

Pro 

Leu 





485 





490 





495 


Arg 

He 

Lys 

Thr 

Gin 

Ala 

lie 

Asn 

Ala 

Ala 

Thr 

Glu 

Ala 

Ala 

lie 

Met 




500 





505 





510 



Val 

Leu 

Arg 

lie 

Asp 

Asp 

Val 

lie 

Ala 

Ser 

Thr 

Gly 

Gly 

Gly Arg 

Ala 



515 





520 





525 




Ala 

Pro 

Gly 

Gly 

Met 

Pro 

Gly Gly 

Asp 

Met 

Glu 

Asp 

Met 

Met 




530 





535 





540 






< 2 10 > SEQ ID NO 9 
<211 > LENGTH: 577 
<212 > TYPE : PRT 

<213 > ORGANISM: Arabodopsis thaliana 
<220 > FEATURE: 

< 22 1 > NAME /KEY: MIS C_FE ATURE 

<223 > OTHER INFORMATION: Mitochondrial thaliana HSP60 


< 4 00 > SEQUENCE: 9 


Met 

Tyr 

Arg 

Phe 

Ala 

Ser 

Asn 

Leu 

Ala 

Ser 

Lys 

Ala 

Arg 

lie 

Ala 

Gin 

1 




5 





10 





15 


Asn 

Ala 

Arg 

Gin 

Val 

Ser 

Ser 

Arg 

Met 

Ser 

Trp 

Ser 

Arg 

Asn 

Tyr 

Ala 




20 





25 





30 



Ala 

Lys 

Glu 

lie 

Lys 

Phe 

Gly 

Val 

Glu 

Ala 

Arg 

Ala 

Leu 

Met 

Leu 

Lys 



35 





40 





45 




Gly Val 

Glu 

Asp 

Leu 

Ala 

Asp 

Ala 

Val 

Lys 

Val 

Thr 

Met 

Gly 

Pro 

Lys 


50 





55 





60 





Gly Arg 

Asn 

Val 

Val 

lie 

Glu 

Gin 

Ser 

Trp 

Gly 

Ala 

Pro 

Lys 

Val 

Thr 

65 





70 





75 





80 

Lys 

Asp 

Gly 

Val 

Thr 

Val 

Ala 

Lys 

Ser 

lie 

Glu 

Phe 

Lys 

Asp 

Lys 

lie 





85 





90 





95 


Lys 

Asn 

Val 

Gly 

Ala 

Ser 

Leu 

Val 

Lys 

Gin 

Val 

Ala 

Asn 

Ala 

Thr 

Asn 




100 





105 





110 



Asp 

Val 

Ala 

Gly 

Asp 

Gly 

Thr 

Thr 

Cys 

Ala 

Thr 

Val 

Leu 

Thr 

Arg 

Ala 



115 





120 





125 




lie 

Phe 

Ala 

Glu 

Gly 

Cys 

Lys 

Ser 

Val 

Ala 

Ala 

Gly 

Met 

Asn 

Ala 

Met 


13 0 





135 





140 





Asp 

Leu 

Arg 

Arg 

Gly 

lie 

Ser 

Met 

Ala 

Val 

Asp 

Ala 

Val 

Val 

Thr 

Asn 

145 





150 





155 





160 


Leu Lys Ser Lys Ala Arg Met lie Ser Thr Ser Glu Glu lie Ala Gin 
165 170 175 
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Val Gly 

Ala Lys 

Asp Gly 
210 

Leu Asp 
225 

Gin Lys 

lie Ser 

Arg Gin 

Leu Ala 
290 

Ala He 
305 

Asp Leu 
Met Asn 


Val Thr 


Lys Lys 
370 

Leu Ser 
385 

Lys Leu 


Ala Glu 


Thr Lys 

Leu Leu 
450 

Asp Gin 
465 

Val Tyr 

Gly Lys 

Lys Gly 

Lys Val 
530 

Leu Thr 
545 

Glu Ser 


Thr lie 
180 

Ala Met 
195 

Lys Thr 

Arg Gly 

Cys Glu 

Ser lie 
260 

Arg Pro 
275 

Thr Leu 

Lys Ala 

Ala Ala 

Leu Glu 
340 

Val Ser 
355 

Gly lie 

Thr Ser 

Ser Gly 

Val Gly 
420 

Ala Ala 
435 

Tyr Ala 

Lys lie 

Thr lie 

Leu Leu 
500 

Glu Tyr 
515 

lie Arg 
Thr Thr 
Gly Ala 


Ser Ala 

Glu Lys 

Leu Phe 

Tyr Thr 
230 

Leu Asp 
245 

Asn Ser 

Leu Leu 

lie Leu 

Pro Gly 
310 

Leu Thr 
325 

Lys Val 

Lys Asp 

Glu Glu 

Asp Tyr 
390 

Gly Val 
405 

Glu Lys 

Val Glu 

Ala Arg 

Gly Val 
470 

Ala Ser 
485 

Glu Gin 

Val Asp 

Thr Ala 

Glu Ala 
550 

Ala Gly 
565 


Asn Gly 

Val Gly 
200 

Asn Glu 
215 

Ser Pro 

Asp Pro 

lie Val 

lie Val 
280 

Asn Lys 
295 

Phe Gly 

Gly Gly 

Asp Leu 

Asp Thr 
360 

Arg Cys 
375 

Asp Lys 

Ala Val 

Lys Asp 

Glu Gly 
440 

Glu Leu 
455 

Gin lie 

Asn Ala 

Asp Asn 

Met Val 
520 

Leu Val 
535 

Val Val 
Ala Gly 


Glu Arg 
185 

Lys Glu 

Leu Glu 

Tyr Phe 

Leu lie 
250 

Lys Val 
265 

Ser Glu 

Leu Arg 

Glu Asn 

Glu Val 
330 

Ser Met 
345 

Val lie 

Glu Gin 

Glu Lys 

Leu Lys 
410 

Arg Val 
425 

lie Leu 

Glu Lys 

lie Gin 

Gly Val 
490 

Pro Asp 
505 

Lys Ala 

Asp Ala 

Val Asp 

Met Gly 
570 


Glu lie 

Gly Val 

Val Val 
220 

lie Thr 
235 

Leu lie 

Leu Glu 

Asp Val 

Ala Gly 
300 

Arg Lys 
315 

lie Thr 

Leu Gly 

Leu Asp 

lie Arg 
380 

Leu Gin 
395 

lie Gly 

Thr Asp 

Pro Gly 

Leu Pro 
460 

Asn Ala 
475 

Glu Gly 

Leu Gly 

Gly lie 

Ala Ser 
540 

Leu Pro 
555 

Gly Met 


Gly Glu 
190 

lie Thr 
205 

Glu Gly 

Asn Gin 

His Glu 

Leu Ala 
270 

Glu Ser 
285 

lie Lys 


Ala Asn 


Asp Glu 

Thr Cys 
350 

Gly Ala 
365 

Ser Ala 

Glu Arg 

Gly Ala 

Ala Leu 
430 

Gly Gly 
445 

Thr Ala 


Leu Lys 


Ala Val 


Tyr Asp 
510 

lie Asp 
525 

Val Ser 
Lys Asp 
Gly Gly 


Leu lie 

lie Gin 

Met Lys 

Lys Thr 
240 

Lys Lys 
255 

Leu Lys 

Asp Ala 

Val Cys 

Leu Gin 
320 

Leu Gly 
335 

Lys Lys 

Gly Asp 

lie Glu 

Leu Ala 
400 

Ser Glu 
415 

Asn Ala 
Val Ala 


Asn Phe 


Thr Pro 
480 

lie Val 
495 

Ala Ala 

Pro Leu 

Ser Leu 

Glu Ser 
560 

Met Asp 
575 


Tyr 
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<210 > SEQ ID NO 10 
<211> LENGTH: 559 
<212 > TYPE: PRT 

<213 > ORGANISM: Saccharomyces sp 
<220 > FEATURE: 

<221> NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: TCP1 alpha subunit 
< 4 00 > SEQUENCE: 10 


Met 

Ser 

Gin 

Leu 

Phe 

Asn 

Asn 

Ser 

Arg 

Ser 

Asp 

Thr 

Leu 

Phe 

Leu 

Gly 

1 




5 





10 





15 


Gly 

Glu 

Lys 

lie 

Ser 

Gly 

Asp 

Asp 

lie 

Arg 

Asn 

Gin 

Asn 

Val 

Leu 

Ala 




20 





25 





30 



Thr 

Met 

Ala 

Val 

Ala 

Asn 

Val 

Val 

Lys 

Ser 

Ser 

Leu 

Gly 

Pro 

Val 

Gly 



35 





40 





45 




Leu 

Asp 

Lys 

Met 

Leu 

Val 

Asp 

Asp 

lie 

Gly 

Asp 

Phe 

Thr 

Val 

Thr 

Asn 


50 





55 





60 





Asp 

Gly 

Ala 

Thr 

lie 

Leu 

Ser 

Leu 

Leu 

Asp 

Val 

Gin 

His 

Pro 

Ala 

Gly 

65 





70 





75 





80 

Lys 

lie 

Leu 

Val 

Glu 

Leu 

Ala 

Gin 

Gin 

Gin 

Asp 

Arg 

Glu 

lie 

Gly 

Asp 





85 





90 





95 


Gly 

Thr 

Thr 

Ser 

Val 

Val 

lie 

lie 

Ala 

Ser 

Glu 

Leu 

Leu 

Lys 

Arg 

Ala 




100 





105 





110 



Asn 

Glu 

Leu 

Val 

Lys 

Asn 

Lys 

lie 

His 

Pro 

Thr 

Thr 

lie 

lie 

Thr 

Gly 



115 





120 





125 




Phe 

Arg 

Val 

Ala 

Leu 

Arg 

Glu 

Ala 

lie 

Arg 

Phe 

lie 

Asn 

Glu 

Val 

Leu 


130 





135 





140 





Ser 

Thr 

Ser 

Val 

Asp 

Thr 

Leu 

Gly 

Lys 

Glu 

Thr 

Leu 

lie 

Asn 

lie 

Ala 

145 





150 





155 





160 

Lys 

Thr 

Ser 

Met 

Ser 

Ser 

Lys 

lie 

lie 

Gly 

Ala 

Asp 

Ser 

Asp 

Phe 

Phe 





165 





170 





175 


Ser 

Asn 

Met 

Val 

Val 

Asp 

Ala 

Leu 

Leu 

Ala 

Val 

Lys 

Thr 

Gin 

Asn 

Ser 




180 





185 





190 



Lys 

Gly 

Glu 

lie 

Lys 

Tyr 

Pro 

Val 

Lys 

Ala 

Val 

Asn 

Val 

Leu 

Lys 

Ala 



195 





200 





205 




His 

Gly 

Lys 

Ser 

Ala 

Thr 

Glu 

Ser 

Leu 

Leu 

Val 

Pro 

Gly 

Tyr 

Ala 

Leu 


210 





215 





220 





Asn 

Cys 

Thr 

Val 

Ala 

Ser 

Gin 

Ala 

Met 

Pro 

Lys 

Arg 

lie 

Ala 

Gly 

Gly 

225 





230 





235 





240 

Asn 

Val 

Lys 

lie 

Ala 

Cys 

Leu 

Asp 

Leu 

Asn 

Leu 

Gin 

Lys 

Ala 

Arg 

Met 





245 





250 





255 


Ala 

Met 

Gly 

Val 

Gin 

lie 

Asn 

lie 

Asp 

Asp 

Pro 

Glu 

Gin 

Leu 

Glu 

Gin 




260 





265 





270 



He 

Arg 

Lys 

Arg 

Glu 

Ala 

Gly 

lie 

Val 

Leu 

Glu 

Arg 

Val 

Lys 

Lys 

lie 



275 





280 





285 




lie 

Asp 

Ala 

Gly 

Ala 

Gin 

Val 

Val 

Leu 

Thr 

Thr 

Lys 

Gly 

lie 

Asp 

Asp 


290 





295 





300 





Leu 

Cys 

Leu 

Lys 

Glu 

Phe 

Val 

Glu 

Ala 

Lys 

lie 

Met 

Gly 

Val 

Arg 

Arg 

305 





310 





315 





320 

Cys 

Lys 

Lys 

Glu 

Asp 

Leu 

Arg 

Arg 

lie 

Ala 

Arg 

Ala 

Thr 

Gly 

Ala 

Thr 





325 





330 





335 


Leu 

Val 

Ser 

Ser 

Met 

Ser 

Asn 

Leu 

Glu 

Gly 

Glu 

Glu 

Thr 

Phe 

Glu 

Ser 




340 





345 





350 




Ser Tyr Leu Gly Leu Cys Asp Glu Val Val Gin Ala Lys Phe Ser Asp 
355 360 365 
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Asp 

Glu 

Cys 

lie 

Leu 

lie 

Lys 

Gly 

Thr 

Ser 

Lys 

His 

Ser 

Ser 

Ser 

Ser 


370 





375 





380 





He 

lie 

Leu 

Arg 

Gly 

Ala 

Asn Asp 

Tyr 

Ser 

Leu 

Asp 

Glu 

Met 

Glu 

Arg 

385 





390 





395 





400 

Ser 

Leu 

His 

Asp 

Ser 

Leu 

Ser 

Val 

Val 

Lys 

Arg 

Thr 

Leu 

Glu 

Ser 

Gly 





405 





410 





415 


Asn 

Val 

Val 

Pro 

Gly 

Gly 

Gly 

Cys 

Val 

Glu 

Ala 

Ala 

Leu 

Asn 

lie 

Tyr 




420 





425 





430 



Leu Asp 

Asn 

Phe 

Ala 

Thr 

Thr 

Val 

Gly 

Ser 

Arg 

Glu 

Gin 

Leu 

Ala 

lie 



435 





440 





445 




Ala 

Glu 

Phe 

Ala 

Ala 

Ala 

Leu 

Leu 

lie 

lie 

Pro 

Lys 

Thr 

Leu 

Ala 

Val 


450 





455 





460 





Asn 

Ala 

Ala 

Lys 

Asp 

Ser 

Ser 

Glu 

Leu 

Val 

Ala 

Lys 

Leu 

Arg 

Ser 

Tyr 

465 





470 





475 





480 

His 

Ala 

Ala 

Ser 

Gin 

Met 

Ala 

Lys 

Pro 

Glu 

Asp 

Val 

Lys 

Arg 

Arg 

Ser 





485 





490 





495 


Tyr 

Arg 

Asn 

Tyr 

Gly 

Leu 

Asp 

Leu 

lie 

Arg 

Gly 

Lys 

lie 

Val 

Asp 

Glu 




500 





505 





510 



lie 

His 

Ala 

Gly 

Val 

Leu 

Glu 

Pro 

Thr 

lie 

Ser 

Lys 

Val 

Lys 

Ser 

Leu 



515 





520 





525 




Lys 

Ser 

Ala 

Leu 

Glu 

Ala 

Cys 

Val 

Ala 

lie 

Leu 

Arg 

lie 

Asp 

Thr 

Met 


530 





535 





540 





lie 

Thr 

Val 

Asp 

Pro 

Glu 

Pro 

Pro 

Lys 

Glu 

Asp 

Pro 

His 

Asp 

His 



545 550 555 


< 2 10 > SEQ ID NO 11 

<211 > LENGTH: 485 

<212 > TYPE : PRT 

<213 > ORGANISM: Homo sapien 

<220 > FEATURE: 

< 22 1 > NAME /KEY: MIS C_FE ATURE 

<223 > OTHER INFORMATION: Human mitochondrial HSP60 


< 4 00 > SEQUENCE: 11 


Met 

Pro 

Ser 

Lys 

Lys 

Met 

Pro 

Gly 

Lys 

Ser 

Asn 

His 

Gly 

Lys 

Asn 

Asn 

1 




5 





10 





15 


Thr 

Phe 

Lys 

Leu 

Arg 

Ala 

Lys 

Phe 

Ser 

Phe 

Pro 

lie 

Leu 

Ala 

Ala 

Asp 




20 





25 





30 



Val 

Pro 

Ser 

Ala 

Phe 

Leu 

Tyr 

Gly 

Thr 

Ser 

His 

Ser 

Gly 

Gin 

Leu 

Ser 



35 





40 





45 




Leu 

Pro 

Gly 

Ala 

Lys 

Arg 

Ser 

Tyr 

Gly 

Gin 

Leu 

Pro 

Pro 

Ser 

Leu 

Ala 


50 





55 





60 





Leu 

Gin 

Asp 

Lys 

Tyr 

Lys 

Asn 

Thr 

Gly 

Ala 

Lys 

Leu 

Val 

Gin 

Asp 

Val 

65 





70 





75 





80 

Ala 

Asn 

Asn 

Thr 

Asn 

Glu 

Glu 

Ala 

Val 

Asp 

Gly 

Thr 

Thr 

Thr 

Val 

Thr 





85 





90 





95 


Ala 

Leu 

Ala 

Arg 

Ser 

lie 

Ala 

Lys 

Glu 

Gly 

Phe 

Glu 

Lys 

lie 

Ser 

Lys 




100 





105 





110 



Gly Ala 

Asn 

Pro 

Val 

Glu 

lie 

Arg 

Arg 

Gly 

Val 

Met 

Leu 

Ala 

Val 

Asp 



115 





120 





125 




Ala 

lie 

lie 

Ala 

Glu 

Pro 

Lys 

Lys 

Gin 

Ser 

Lys 

Pro 

Val 

Thr 

Thr 

Pro 


13 0 





135 





140 





Glu 

Glu 

lie 

Ala 

Arg 

Val 

Ala 

Thr 

lie 

Ser 

Ala 

Asn 

Gly 

Asp 

Lys 

Glu 

145 





150 





155 





160 


lie Gly Asn lie lie Ser Asp Ala Met Lys Lys Val Gly Ser Lys Gly 
165 170 175 
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He 

lie 

Thr 

Val 

Asn 

Asn 

Gly 

Lys 

Ser 

Gin 

Lys 

Cys 

Glu 

Phe 

Gin 

Asp 




180 





185 





190 



Ala 

Tyr 

Val 

Leu 

Leu 

Ser 

Glu 

Lys 

Lys 

lie 

Ser 

Ser 

Val 

Gin 

Ser 

lie 



195 





200 





205 




Ala 

Pro 

Ala 

Leu 

Glu 

lie 

Ala 

Asn 

Ala 

Tyr 

Ser 

Leu 

Val 

lie 

lie 

Ala 


210 





215 





220 





Glu Asp 

Val 

Asn 

Gly 

Glu 

Ala 

Leu 

Ser 

Thr 

Leu 

Val 

Leu 

Asn 

Arg 

Leu 

225 





230 





235 





240 

Lys 

Val 

Gly 

Leu 

Gin 

Val 

Val 

Ala 

Val 

Lys 

Asp 

Pro 

Gly 

Phe 

Gly Asp 





245 





250 





255 


Asn 

Arg 

Asn 

Asn 

Gin 

Leu 

Lys 

Asp 

Met 

Ala 

lie 

Ala 

Thr 

Gly 

Gly 

Ala 




260 





265 





270 



Val 

Phe 

Ala 

Glu 

Glu 

Gly 

Leu 

Thr 

Leu 

Asn 

Leu 

Glu 

Asp 

Val 

Gin 

Pro 



275 





280 





285 




His 

Asp 

Leu 

Gly 

Lys 

Val 

Gly 

Glu 

Val 

lie 

Val 

Thr 

Lys 

Asp 

Asp 

Ala 


290 





295 





300 





Met 

Leu 

Leu 

Lys 

Gly 

Lys 

Asp 

Gly Val 

Ala 

Val 

Leu 

Lys 

Val 

Gly 

Gly 

305 





310 





315 





320 

Thr 

Ser 

Asp 

Ala 

Glu 

Val 

Asn 

Glu 

Lys 

Gin 

Asp 

Arg 

Val 

Thr 

Asp 

Ala 





325 





330 





335 


Leu 

Asn 

Ala 

Thr 

Arg 

Ala 

Ala 

Val 

Glu 

Glu 

Gly 

lie 

Val 

Leu 

Arg 

Gly 




340 





345 





350 



Gly Arg 

Ala 

Leu 

Leu 

Arg 

Cys 

lie 

Pro 

Ala 

Leu 

Asp 

Ser 

Leu 

Thr 

Pro 



355 





360 





365 




Val 

Asn 

Glu 

Asp 

His 

Asn 

lie 

Gly 

lie 

Glu 

lie 

lie 

Lys 

Lys 

Thr 

Leu 


370 





375 





380 





Lys 

Phe 

Pro 

Ala 

Met 

Thr 

lie 

Ala 

Lys 

Asn 

Ala 

Gly 

Val 

Glu 

Val 

Ser 

385 





390 





395 





400 

Leu 

lie 

Val 

Glu 

Lys 

lie 

Met 

Gin 

Ser 

Ser 

Ser 

Glu 

Val 

Gly 

Tyr 

Asp 





405 





410 





415 


Ala 

Met 

Gly 

Arg 

Asp 

Phe 

Val 

Asn 

Met 

Val 

Glu 

Lys 

Gly 

lie 

lie 

Asp 




420 





425 





430 



Thr 

Thr 

Lys 

Phe 

Val 

Arg 

Thr 

Ala 

Leu 

Leu 

Asp 

Ala 

Ser 

Gly 

Val 

Ala 



435 





440 





445 




Ser 

Leu 

Leu 

Thr 

Thr 

Ala 

Glu 

Val 

Leu 

Val 

Thr 

Glu 

lie 

Pro 

Lys 

Glu 


450 





455 





460 





Glu 

Lys 

Asp 

Pro 

Gly 

Met 

Gly Ala 

Met 

Asp 

Gly 

Met 

Gly 

Gly 

Gly 

Met 

465 





470 





475 





480 

Gly Gly 

Gly 

Met 

Phe 













485 


<210 > SEQ ID NO 12 
<211 > LENGTH: 573 
<212 > TYPE: PRT 
<213 > ORGANISM: Mus sp 
<220 > FEATURE: 

<221> NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: Mouse mitochondrial HSP60 


<400 > SEQUENCE: 12 


Met 

1 

Leu Arg 

Leu 

Pro 

5 

Thr 

Val 

Leu 

Arg 

Gin 

10 

Met 

Arg 

Pro 

Val 

Ser 

15 

Arg 

Ala 

Leu Ala 

Pro 

20 

His 

Leu 

Thr 

Arg 

Ala 

25 

Tyr 

Ala 

Lys 

Asp 

Val 

30 

Lys 

Phe 


Gly Ala Asp Ala Arg Ala Leu Met Leu Gin Ala Val Asp Leu Leu Ala 
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Asp Ala 
50 

Glu Gin 
65 

Ala Lys 

Leu Val 

Thr Thr 

Glu Lys 
130 

Met Leu 
145 

Pro Val 

Asn Gly 

Val Gly 

Asp Glu 
210 

Ser Pro 
225 

Asp Ala 

lie Val 

lie He 

Asn Arg 
290 

Phe Gly 
305 

Gly Gly 

Val Gin 

Asp Asp 

Lys Arg 
370 

Tyr Glu 
385 

Val Ala 

Lys Lys 

Glu Glu 

Pro Ala 
450 


35 

Val Ala 

Ser Trp 

Ser lie 

Gin Asp 
100 

Thr Ser 
115 

lie Ser 

Ala Val 

Thr Thr 

Asp Lys 
180 

Arg Lys 
195 

Leu Glu 

Tyr Phe 

Tyr Val 

Pro Thr 
260 

Ala Glu 
275 

Leu Lys 

Asp Asn 

Ala Val 

Ala His 
340 

Ala Met 
355 

lie Gin 

Lys Glu 

Val Leu 

Asp Arg 
420 

Gly lie 
435 

Leu Asp 


Val Thr 

Gly Ser 
70 

Asp Leu 
85 

Val Ala 

Thr Val 

Lys Gly 

Asp Ala 
150 

Pro Glu 
165 

Asp lie 

Gly Val 

lie lie 

lie Asn 
230 

Leu Leu 
245 

Leu Glu 

Asp Val 

Val Gly 

Arg Lys 
310 

Phe Gly 
325 

Asp Leu 

Leu Leu 

Glu lie 

Lys Leu 
390 

Lys Val 
405 

Val Thr 
Val Leu 


Ser Leu 


40 

Met Gly 
55 

Pro Lys 


Lys Asp 


Asn Asn 

Leu Ala 
120 

Ala Asn 
135 

Val lie 

Glu lie 

Gly Asn 

lie Thr 
200 

Glu Gly 
215 

Thr Ser 

Ser Glu 

lie Ala 

Asp Gly 
280 

Leu Gin 
295 

Asn Gin 

Glu Glu 

Gly Lys 

Lys Gly 
360 

Thr Glu 
375 

Asn Glu 

Gly Gly 

Asp Ala 

Gly Gly 
440 

Lys Pro 
455 


Pro Lys 

Val Thr 

Lys Tyr 
90 

Thr Asn 
105 

Arg Ser 

Pro Val 

Ala Glu 

Ala Gin 
170 

lie lie 
185 

Val Lys 

Met Lys 

Lys Gly 

Lys Lys 
250 

Asn Ala 
265 

Glu Ala 

Val Val 

Leu Lys 

Gly Leu 
330 

Val Gly 
345 

Lys Gly 

Gin Leu 

Arg Leu 

Thr Ser 
410 

Leu Asn 
425 

Gly Cys 
Ala Asn 


Gly 

Arg 

60 

Lys 

75 

Asp 

Lys 

Asn 

Glu 

Glu 

lie 

Ala 

Glu 

lie 


140 

Leu 

155 

Lys 

Val 

Ala 

Ser 

Asp 

Asp 

Gly 

Phe 

Asp 

220 

Gin 

235 

Lys 

lie 

Ser 

His 

Arg 

Leu 

Ser 

Ala 

Val 


300 

Asp 

315 

Met 

Asn 

Leu 

Glu 

Val 

Asp 

Lys 

Asp 

lie 

380 

Ala 

395 

Lys 

Asp 

Val 

Ala 

Thr 

Ala 

Leu 

Glu 

Asp 

460 


45 

Thr Val 

Gly Val 

lie Gly 

Ala Gly 
110 

Lys Glu 
125 

Arg Arg 

Lys Gin 

Thr lie 

Ala Met 
190 

Lys Thr 
205 

Arg Gly 

Cys Glu 

Ser Val 

Lys Pro 
270 

Thr Met 
285 

Lys Ala 

Ala lie 

Asn Leu 

lie Val 
350 

Ala His 
365 

Thr Thr 

Leu Ser 

Glu Val 

Arg Ala 
430 

Leu Arg 
445 

Gin Lys 


lie lie 

Thr Val 
80 

Ala Lys 
95 

Asp Gly 

Gly Phe 

Gly Val 

Ser Lys 
160 

Ser Ala 
175 

Lys Lys 

Leu Asn 

Tyr lie 

Phe Gin 
240 

Gin Ser 
255 

Leu Val 

Val Leu 

Pro Gly 

Ala Thr 
320 

Glu Asp 
335 

Thr Lys 

lie Glu 

Ser Glu 

Asp Gly 
400 

Asn Glu 
415 

Ala Val 
Cys lie 
lie Gly 




89 


US 7,795,388 B2 


-continued 


He 

Glu 

lie 

lie 

Lys 

Arg 

Ala 

Leu 

Lys 

lie 

Pro 

Ala 

Met 

Thr 

lie 

Ala 

465 





470 





475 





480 

Lys 

Asn 

Ala 

Gly 

Val 

Glu 

Gly 

Ser 

Leu 

lie 

Val 

Glu 

Lys 

lie 

Leu 

Gin 





485 





490 





495 


Ser 

Ser 

Ser 

Glu 

Val 

Gly 

Tyr Asp 

Ala 

Arg 

Leu 

Gly 

Asp 

Phe 

Val 

Asn 




500 





505 





510 



Met 

Val 

Glu 

Lys 

Gly 

lie 

lie 

Asp 

Pro 

Thr 

Lys 

Val 

Val 

Arg 

Thr 

Ala 



515 





520 





525 




Leu 

Leu 

Asp 

Ala 

Ala 

Gly 

Val 

Ala 

Ser 

Leu 

Leu 

Thr 

Thr 

Ala 

Glu 

Ala 


530 





535 





540 





Val 

Val 

Thr 

Glu 

lie 

Pro 

Lys 

Glu 

Glu 

Lys 

Asp 

Pro 

Gly 

Met 

Gly 

Ala 

545 





550 





555 





560 

Met 

Gly 

Gly 

Met 

Gly 

Gly 

Gly 

Met 

Gly 

Gly 

Ser 

Met 

Leu 





565 570 


< 2 10 > SEQ ID NO 13 

< 2 11 > LENGTH: 556 

<212 > TYPE: PRT 

<213> ORGANISM: Homo sapien 

<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FE ATURE 

<223 > OTHER INFORMATION: Human TCP1 


< 4 00 > SEQUENCE: 13 


Met 

Glu 

Gly 

Pro 

Leu 

Ser 

Val 

Phe 

Gly 

Asp 

Arg 

Ser 

Thr 

Gly 

Glu 

Thr 

1 




5 





10 





15 


lie 

Arg 

Ser 

Gin 

Asn 

Val 

Met 

Ala 

Ala 

Ala 

Ser 

lie 

Ala 

Asn 

lie 

Val 




20 





25 





30 



Lys 

Ser 

Ser 

Leu 

Gly 

Pro 

Val 

Gly 

Leu 

Asp 

Lys 

Met 

Leu 

Val 

Asp 

Asp 



35 





40 





45 




lie 

Gly 

Asp 

Val 

Thr 

lie 

Thr 

Asn 

Asp 

Gly 

Ala 

Thr 

lie 

Leu 

Lys 

Leu 


50 





55 





60 





Leu 

Glu 

Val 

Glu 

His 

Pro 

Ala 

Ala 

Lys 

Val 

Leu 

Cys 

Glu 

Leu 

Ala 

Asp 

65 





70 





75 





80 

Leu 

Gin 

Asp 

Lys 

Glu 

Val 

Gly 

Asp 

Gly 

Thr 

Thr 

Ser 

Val 

Val 

lie 

lie 





85 





90 





95 


Ala 

Ala 

Glu 

Leu 

Leu 

Lys 

Asn 

Ala 

Asp 

Glu 

Leu 

Val 

Lys 

Gin 

Lys 

lie 




100 





105 





110 



His 

Pro 

Thr 

Ser 

Val 

lie 

Ser 

Gly 

Tyr 

Arg 

Leu 

Ala 

Cys 

Lys 

Glu 

Ala 



115 





120 





125 




Val 

Arg 

Tyr 

lie 

Asn 

Glu 

Asn 

Leu 

lie 

Val 

Asn 

Thr 

Asp 

Glu 

Leu 

Gly 


130 





135 





140 





Arg Asp 

Cys 

Leu 

lie 

Asn 

Ala 

Ala 

Lys 

Thr 

Ser 

Met 

Ser 

Ser 

Lys 

lie 

145 





150 





155 





160 

lie 

Gly 

lie 

Asn 

Gly 

Asp 

Phe 

Phe 

Ala 

Asn 

Met 

Val 

Val 

Asp 

Ala 

Val 





165 





170 





175 


Leu 

Ala 

lie 

Lys 

Tyr 

Thr 

Asp 

lie 

Arg 

Gly 

Gin 

Pro 

Arg 

Tyr 

Pro 

Val 




180 





185 





190 



Asn 

Ser 

Val 

Asn 

lie 

Leu 

Lys 

Ala 

His 

Gly 

Arg 

Ser 

Gin 

Met 

Glu 

Ser 



195 





200 





205 




Met 

Leu 

lie 

Ser 

Gly 

Tyr 

Ala 

Leu 

Asn 

Cys 

Val 

Val 

Gly 

Ser 

Gin 

Gly 


210 





215 





220 





Met 

Pro 

Lys 

Arg 

lie 

Val 

Asn 

Ala 

Lys 

lie 

Ala 

Cys 

Leu 

Asp 

Phe 

Ser 

225 





230 





235 





240 


Leu Gin Lys Thr Lys Met Lys Leu Gly Val Gin Val Val lie Thr Asp 
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245 250 255 


Pro 

Glu 

Lys 

Leu 

Asp 

Gin 

lie 

Arg 

Gin Arg 

Glu 

Ser 

Asp 

lie 

Thr 

Lys 




260 





265 





270 



Glu Arg 

He 

Gin 

Lys 

lie 

Leu 

Ala 

Thr 

Gly Ala 

Asn 

Val 

lie 

Leu 

Thr 



275 





280 





285 




Thr 

Gly 

Gly 

lie 

Asp 

Asp 

Met 

Cys 

Leu 

Lys 

Tyr 

Phe 

Val 

Glu 

Ala 

Gly 


290 





295 





300 





Ala 

Met 

Ala 

Val 

Arg 

Arg 

Val 

Leu 

Lys 

Arg 

Asp 

Leu 

Lys 

Arg 

lie 

Ala 

305 





310 





315 





320 

Lys 

Ala 

Ser 

Gly 

Ala 

Thr 

lie 

Leu 

Ser 

Thr 

Leu 

Ala 

Asn 

Leu 

Glu 

Gly 





325 





330 





335 


Glu 

Glu 

Thr 

Phe 

Glu 

Ala 

Ala 

Met 

Leu 

Gly 

Gin 

Ala 

Glu 

Glu 

Val 

Val 




340 





345 





350 



Gin 

Glu 

Arg 

lie 

Cys 

Asp 

Asp 

Glu 

Leu 

lie 

Leu 

lie 

Lys 

Asn 

Thr 

Lys 



355 





360 





365 




Ala 

Arg 

Thr 

Ser 

Ala 

Ser 

lie 

lie 

Leu 

Arg 

Gly 

Ala 

Asn 

Asp 

Phe 

Met 


370 





375 





380 





Cys 

Asp 

Glu 

Met 

Glu 

Arg 

Ser 

Leu 

His 

Asp 

Ala 

Leu 

Cys 

Val 

Val 

Lys 

385 





390 





395 





400 

Arg 

Val 

Leu 

Glu 

Ser 

Lys 

Ser 

Val 

Val 

Pro 

Gly 

Gly 

Gly 

Ala 

Val 

Glu 





405 





410 





415 


Ala 

Ala 

Leu 

Ser 

lie 

Tyr 

Leu 

Glu 

Asn 

Tyr 

Ala 

Thr 

Ser 

Met 

Gly 

Ser 




420 





425 





430 



Arg 

Glu 

Gin 

Leu 

Ala 

lie 

Ala 

Glu 

Phe 

Ala 

Arg 

Ser 

Leu 

Leu 

Val 

lie 



435 





440 





445 




Pro 

Asn 

Thr 

Leu 

Ala 

Val 

Asn 

Ala 

Ala 

Gin 

Asp 

Ser 

Thr 

Asp 

Leu 

Val 


450 





455 





460 





Ala 

Lys 

Leu 

Arg 

Ala 

Phe 

His 

Asn 

Glu 

Ala 

Gin 

Val 

Asn 

Pro 

Glu 

Arg 

465 





470 





475 





480 

Lys 

Asn 

Leu 

Lys 

Trp 

lie 

Gly 

Leu 

Asp 

Leu 

Ser 

Asn 

Gly 

Lys 

Pro 

Arg 





485 





490 





495 


Asp 

Asn 

Lys 

Gin 

Ala 

Gly 

Val 

Phe 

Glu 

Pro 

Thr 

lie 

Val 

Lys 

Val 

Lys 




500 





505 





510 



Ser 

Leu 

Lys 

Phe 

Ala 

Thr 

Glu 

Ala 

Ala 

lie 

Thr 

lie 

Leu 

Arg 

lie 

Asp 



515 





520 





525 




Asp 

Leu 

lie 

Lys 

Leu 

His 

Pro 

Glu 

Ser 

Lys 

Asp 

Asp 

Lys 

His 

Gly 

Ser 


530 





535 





540 





Tyr 

Glu 

Asp 

Ala 

Val 

His 

Ser 

Gly Ala 

Leu 

Asn 

Asp 





545 





550 





555 







<210 > SEQ ID NO 14 
<211> LENGTH: 556 
<212 > TYPE: PRT 
<213 > ORGANISM: Mus sp 
<220 > FEATURE: 

<221> NAME /KEY: MIS C_FE ATURE 

<223 > OTHER INFORMATION: Mouse TCP1 

<400 > SEQUENCE: 14 

Met Glu Gly Pro Leu Ser Val Phe Gly Asp Arg Ser Thr Gly Glu Ala 
15 10 15 

Val Arg Ser Gin Asn Val Met Ala Ala Ala Ser lie Ala Asn lie Val 
20 25 30 


Lys Ser Ser Phe Gly Pro Val Gly Leu Asp Lys Met Leu Val Asp Asp 
35 40 45 




93 


US 7,795,388 B2 


-continued 


He Gly 
50 

Leu Glu 
65 

Leu Gin 

Ala Ala 

His Pro 

Val Arg 
130 

Arg Asp 
145 

lie Gly 

Leu Ala 

Asn Ser 

Met Leu 
210 

Met Pro 
225 

Leu Gin 

Pro Glu 

Glu Arg 

Thr Gly 
290 

Ala Met 
305 

Lys Ala 

Glu Glu 

Gin Glu 

Ala Arg 
370 

Cys Asp 
385 

Arg Val 

Ala Ala 

Arg Glu 

Pro Asn 
450 

Ala Lys 


Asp Val 

Val Glu 

Asp Lys 

Glu Leu 
100 

Thr Ser 
115 

Tyr lie 

Cys Leu 

lie Asn 

Val Lys 
180 

Val Asn 
195 

lie Asn 

Lys Arg 

Lys Thr 

Lys Leu 
260 

lie Gin 
275 

Gly lie 

Ala Val 

Ser Gly 

Thr Phe 
340 

Arg lie 
355 

Thr Ser 

Glu Met 

Leu Glu 

Leu Ser 
420 

Gin Leu 
435 

Thr Leu 


Leu Arg 


Thr lie 

His Pro 
70 

Glu Val 
85 

Leu Lys 

Val lie 

Asn Glu 

lie Asn 
150 

Gly Asp 
165 

Tyr Thr 

lie Leu 

Gly Tyr 

lie Val 
230 

Lys Met 
245 

Asp Gin 

Lys lie 

Asp Asp 

Arg Arg 
310 

Ala Ser 
325 

Glu Val 

Cys Asp 

Ala Ser 

Glu Arg 
390 

Leu Lys 
405 

lie Tyr 
Ala lie 
Ala Val 
Ala Phe 


Thr Asn 
55 

Ala Ala 

Gly Asp 

Asn Ala 

Ser Gly 
120 

Asn Leu 
135 

Ala Ala 

Tyr Phe 

Asp Ala 

Lys Ala 
200 

Ala Leu 
215 

Asn Ala 

Lys Leu 

lie Arg 

Leu Ala 
280 

Met Tyr 
295 

Val Leu 

lie Leu 

Thr Met 

Asp Glu 
360 

lie lie 
375 

Ser Leu 

Ser Val 

Leu Glu 

Ala Glu 
440 

Asn Ala 
455 

His Asn 


Asp 

Gly 

Lys 

Val 

Gly 

Thr 

90 

Asp 

105 

Glu 

Tyr 

Arg 

lie 

lie 

Lys 

Thr 

Ala 

Asn 

170 

Arg 

185 

Gly 

His 

Gly 

Asn 

Cys 

Lys 

lie 

Gly 

Val 

250 

Gin 

265 

Arg 

Thr 

Gly 

Leu 

Lys 

Lys 

Arg 

Ser 

Thr 

330 

Leu 

345 

Gly 

Leu 

lie 

Leu 

Arg 

His 

Asp 

Val 

Pro 

410 

Asn 

425 

Tyr 

Phe 

Ala 

Ala 

Gin 

Glu 

Ala 


Ala Thr 
60 

Leu Cys 
75 

Thr Ser 


Leu Val 


Leu Ala 


Asn Thr 
140 

Ser Met 
155 

Met Val 

Gin Pro 

Arg Ser 

Val Val 
220 

Ala Cys 
235 

Gin Val 
Glu Ser 


Ala Asn 


Tyr Phe 
300 

Asp Leu 
315 

Leu Ala 

Gin Ala 

Leu lie 

Gly Ala 
380 

Ala Leu 
395 

Gly Gly 

Ala Thr 

Arg Ser 

Asp Ser 
460 

Gin Val 


lie Leu 

Glu Leu 

Val Val 

Lys Gin 
110 

Cys Lys 
125 

Asp Glu 

Ser Ser 

Val Asp 

Arg Tyr 
190 

Gin lie 
205 

Gly Ser 

Leu Asp 

Val lie 

Asp lie 
270 

Val lie 
285 

Val Glu 

Lys Hi s 

Asn Leu 

Glu Glu 
350 

Lys Asn 
365 

Asn Asp 

Cys Val 

Gly Ala 

Ser Met 
430 

Leu Leu 
445 

Thr Asp 
Asn Pro 


Lys Leu 

Ala Asp 
80 

lie lie 
95 

Lys lie 

Glu Ala 

Leu Gly 

Lys lie 
160 

Ala Val 
175 

Pro Val 


Glu Ser 


Gin Gly 


Phe Ser 
240 

Thr Asp 
255 

Thr Lys 

Leu Thr 

Ala Gly 

Val Ala 
320 

Glu Gly 
335 

Val Val 
Thr Lys 


Phe Met 

Val Lys 
400 

Val Glu 
415 

Gly Ser 
Val lie 
Leu Val 
Glu Arg 
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465 





470 





475 





480 

Lys 

Asn 

Leu 

Lys 

Trp 

lie 

Gly 

Leu 

Asp 

Leu 

Val 

His 

Gly 

Lys 

Pro 

Arg 





485 





490 





495 


Asp 

Asn 

Lys 

Gin 

Ala 

Gly 

Val 

Phe 

Glu 

Pro 

Thr 

lie 

Val 

Lys 

Val 

Lys 




500 





505 





510 



Ser 

Leu 

Lys 

Phe 

Ala 

Thr 

Glu 

Ala 

Ala 

lie 

Thr 

lie 

Leu 

Arg 

lie 

Asp 



515 





520 





525 




Asp 

Leu 

He 

Lys 

Leu 

His 

Pro 

Glu 

Ser 

Lys 

Asp 

Asp 

Lys 

His 

Gly 

Ser 


530 





535 





540 





Tyr 

Glu 

Asn 

Ala 

Val 

His 

Ser 

Gly 

Ala 

Leu 

Asp 

Asp 





545 





550 





555 







< 2 10 > SEQ ID NO 15 
< 2 11 > LENGTH: 606 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223 > OTHER INFORMATION: alignment consensus sequence 

< 4 00 > SEQUENCE: 15 


Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

1 




5 





10 





15 


Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Pro 

lie 

Glx 

lie 

Glx 

Glu 




20 





25 





30 



Ala 

Glx 

Glx 

Arg 

Glx 

Phe 

Gly 

Glx 

Asp 

Ala 

Arg 

Glx 

Glx 

Asn 

lie 

Glx 



35 





40 





45 




Ala 

Ala 

Glx 

Ala 

Leu 

Ala 

Glu 

Ala 

Val 

Lys 

Ser 

Thr 

Leu 

Gly 

Pro 

Lys 


50 





55 





60 





Gly 

Leu 

Asp 

Lys 

Met 

Leu 

Val 

Asp 

Ser 

Trp 

Gly Asp 

lie 

Thr 

lie 

Thr 

65 





70 





75 





80 

Asn 

Asp 

Gly 

Glx 

Thr 

lie 

Leu 

Lys 

Glu 

lie 

Glu 

Leu 

Glu 

His 

Pro 

Glx 





85 





90 





95 


Glx 

Glx 

Glx 

Gly 

Ala 

Lys 

Leu 

Leu 

Glx 

Glu 

Val 

Ala 

Glx 

Glx 

Gin 

Asp 




100 





105 





110 



Asp 

Glu 

Glx 

Gly 

Asp 

Gly 

Thr 

Thr 

Thr 

Ala 

Val 

Val 

Leu 

Ala 

Glx 

Ala 



115 





120 





125 




Leu 

Leu 

Lys 

Glx 

Ala 

Glx 

Glu 

Leu 

Val 

Glx 

Glx 

Gly 

lie 

His 

Pro 

Thr 


130 





135 





140 





Glx 

Glx 

lie 

Glx 

Gly 

Tyr 

Glx 

Leu 

Ala 

Val 

Glu 

Glx 

Ala 

Val 

Arg 

Glx 

145 





150 





155 





160 

lie 

Glx 

Glx 

Glx 

Ala 

Glx 

Glx 

Glx 

Glx 

Glx 

Val 

Glx 

Glx 

Glx 

Glu 

Glx 





165 





170 





175 


lie 

Glx 

Gin 

Val 

Ala 

Glx 

Thr 

Ser 

Ala 

Glx 

Ser 

Lys 

Glx 

Glx 

Glx 

Gly 




180 





185 





190 



Glx 

Glx 

Glx 

Ala 

Asp 

Ala 

Met 

Glx 

Glx 

Val 

Gly Val 

Glu 

Ala 

Val 

lie 



195 





200 





205 




Thr 

Val 

Glx 

Glu 

Glx 

Lys 

Glx 

Gly 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Val 

Glu 


210 





215 





220 





Glx 

Val 

Lys 

lie 

Asp 

Lys 

Gly 

Tyr 

Gly Glx 

Ser 

Glx 

Glx 

Asp 

Ser 

Glx 

225 





230 





235 





240 

Leu 

lie 

Glx 

Gly 

Glx 

Glx 

Glx 

Glx 

Glx 

Val 

Glx 

Glu 

Glx 

Glx 

Gly 

Met 





245 





250 





255 


Pro 

Lys 

Lys 

lie 

Glx 

Glx 

Glx 

Glx 

Ala 

Lys 

lie 

Glx 

Leu 

Leu 

Asp 

Glx 




260 





265 





270 




Glx Leu Glx Glx Glx Lys Pro Glx Leu Glx lie Glx lie Glx lie Glu 
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275 280 


Glx 

Glx 

290 

Ala 

Leu 

Ser 

Glx 

Leu 

295 

Val 

Lys 

305 

Glu 

Val 

Ala 

Glx 

Lys 

310 

lie 

Glx 

Glx 

Lys 

Gly 

lie Asp 
325 

Asp 

Leu 

Glx 

Glx 

Glx 

Glx 

Glx 

340 

Leu 

Ala 

Leu 

Arg 

Lys 

Leu 

Ala 

355 

Lys 

Ala 

Thr 

Gly 

Ala 

360 

Leu 

Glx 

370 

Gly 

Glu 

Glx 

Glx 

Glx 

375 

Glx 

Glx 

385 

Ala 

Glx 

Glu 

Val 

Glx 

390 

Glx 

Glx 

Glx 

He 

Glx 

Ala 

Glx 

405 

Lys 

Ala 

Glx 

Gly Ala 

Thr 

Glu 

420 

Glx 

Glx 

Val 

Asp 

Ala 

Leu 

Glx 

435 

Val 

Lys 

Ala 

Ala 

Leu 

440 

Gly Gly 
450 

Gly 

Ala 

Leu 

Glu 

Glx 

455 

Leu 

Ala 

465 

Glx 

Thr 

Val 

Glx 

Gly 

470 

Arg 

Glu 

Glx 

Ala 

Leu 

Glu 

Glx 

485 

lie 

Pro 

Glx 

Asp 

Glx 

Glx 

Asp 

500 

lie 

Val 

Glx 

Lys 

Glx 

Glx 

Glx 

515 

Glx 

Glx 

Glx 

Glx 

Glx 

520 

Gly 

Leu 

530 

Asp 

Leu 

Glx 

Glx 

Glx 

535 

Glx 

Gly Val 
545 

lie 

Asp 

Pro 

Glx 

550 

Lys 

Val 

Thr 

Glu 

Ala 

Ala 

Glx 

565 

Leu 

lie 

Leu 

Glx 

Pro 

Glx 

Glx 

580 

Glx 

Glx 

Asp 

Glx 

Glx 

Glx 

Glx 

595 

Met 

Gly 

Glx 

Glx 

Glx 

600 


285 


Leu 

Asn 

Arg 

Glu 

Arg 

Glx 

lie 

Leu 




300 





Gly 

Glx 

Gly 

Ala 

Asn 

Val 

Val 

Glx 



315 





320 

Glx 

Glx 

Glx 

Glx 

Leu 

lie 

Glx 

Glx 


330 





335 


Arg 

Val 

Lys 

Lys 

Glx 

Asp 

Leu 

Glx 

345 





350 



Lys 

lie 

Val 

Thr 

Thr 

lie 

Glx 

Glu 





365 




Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Leu 

Gly 




380 





Lys 

Glx 

Glx 

Glx 

Asp 

Lys 

Leu 

Glx 



395 





400 

Gly 

Val 

Ala 

Ser 

lie 

Leu 

Leu 

Arg 


410 





415 


Glu 

Glx 

Glu 

Arg 

Ser 

Leu 

Glx 

Asp 

425 





430 



Glu 

Glx 

Glu 

Gly 

Glx 

Val 

Val 

Gly 





445 




Ala 

Glx 

Leu 

Leu 

Glx 

Glx 

Glx 

Tyr 




460 





Gin 

Leu 

Ala 

lie 

Glx 

Glx 

Phe 

Ala 



475 





480 

Thr 

Leu 

Ala 

Glx 

Asn 

Ala 

Gly 

Leu 


490 





495 


Leu 

Arg 

Ser 

Glx 

His 

Glx 

Glx 

Glx 

505 





510 



Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 

Glx 





525 




Glx 

Gly 

Glx 

Asp 

Met 

Val 

Glx 

Glx 




540 





Lys 

Arg 

Glx 

Ala 

Leu 

Glx 

Glx 

Ala 



555 





560 

Arg 

lie 

Asp 

Asp 

Val 

Val 

Glx 

Glx 


570 





575 


Glx 

Glx 

Glx 

Ala 

Glx 

Glx 

Glx 

Glx 

585 





590 



Glx 

Glx 

Glx 

Glx 

Glx 

Glx 




605 


<210 > SEQ ID NO 16 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs or InP 
<400 > SEQUENCE: 16 

Val Thr Ser Pro Asp Ser Thr Thr Gly Ala Met Ala 
15 10 


<210 > SEQ ID NO 17 
<211> LENGTH: 12 
<212 > TYPE: PRT 




99 


US 7,795,388 B2 


100 


-continued 


<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs or InP 
< 4 00 > SEQUENCE: 17 

Ala Ala Ser Pro Thr Gin Ser Met Ser Gin Ala Pro 
15 10 


<210 > SEQ ID NO 18 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs or InP 
< 4 00 > SEQUENCE: 18 

Ala Gin Asn Pro Ser Asp Asn Asn Thr His Thr His 
15 10 


< 2 10 > SEQ ID NO 19 
< 2 11 > LENGTH: 12 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223> OTHER INFORMATION: metal binding peptide - GaAs or InP 

< 4 00 > SEQUENCE: 19 

Ala Ser Ser Ser Arg Ser His Phe Gly Gin Thr Asp 
15 10 


< 2 10 > SEQ ID NO 20 

< 2 11 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223> OTHER INFORMATION: metal binding peptide - GaAs or InP 
<400 > SEQUENCE: 20 

Trp Ala His Ala Pro Gin Leu Ala Ser Ser Ser Thr 
15 10 


<210 > SEQ ID NO 21 

< 2 11 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs or InP 
<400 > SEQUENCE: 21 

Ala Arg Tyr Asp Leu Ser lie Pro Ser Ser Glu Ser 
15 10 


<210 > SEQ ID NO 22 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs or InP 
<400 > SEQUENCE: 22 

Thr Pro Pro Arg Pro lie Gin Tyr Asn His Thr Ser 
15 10 


<210 > SEQ ID NO 23 

<211> LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 
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<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs or InP 
< 4 00 > SEQUENCE: 23 

Ser Ser Leu Gin Leu Pro Glu Asn Ser Phe Pro His 
15 10 


<210 > SEQ ID NO 24 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs or InP 
< 4 00 > SEQUENCE: 24 

Gly Thr Leu Ala Asn Gin Gin lie Phe Leu Ser Ser 
15 10 


< 2 10 > SEQ ID NO 25 

< 2 11 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs or InP 
<400 > SEQUENCE: 25 

His Gly Asn Pro Leu Pro Met Thr Pro Phe Pro Gly 
15 10 


< 2 10 > SEQ ID NO 26 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs or InP 
<400 > SEQUENCE: 26 

Arg Leu Glu Leu Ala lie Pro Leu Gin Gly Ser Gly 
15 10 


< 2 10 > SEQ ID NO 27 

<211 > LENGTH: 9 

<212 > TYPE : PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - ZnS 
<400 > SEQUENCE: 27 

Cys Asn Asn Pro Met His Gin Asn Cys 
1 5 


<210 > SEQ ID NO 28 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Ag 
<400 > SEQUENCE: 28 

Ala Tyr Ser Ser Gly Ala Pro Pro Met Pro Pro Phe 
15 10 


<210 > SEQ ID NO 29 

<211> LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 
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<223 > OTHER INFORMATION: metal binding peptide - Ag 
<400 > SEQUENCE: 29 

Asn Pro Ser Ser Leu Phe Arg Tyr Leu Pro Ser Asp 
15 10 


<210 > SEQ ID NO 30 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Ag 
<400 > SEQUENCE: 30 

Ser Leu Ala Thr Gin Pro Pro Arg Thr Pro Pro Val 
15 10 


< 2 10 > SEQ ID NO 31 
< 2 11 > LENGTH: 14 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Au 

< 4 00 > SEQUENCE: 31 

Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr lie Gin Ser 
15 10 


< 2 10 > SEQ ID NO 32 

<211 > LENGTH: 14 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Au 
< 4 00 > SEQUENCE: 32 

Ala Leu Val Pro Thr Ala His Arg Leu Asp Gly Asn Met His 
15 10 


<210 > SEQ ID NO 33 

<211 > LENGTH: 21 

<212 > TYPE: DNA 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: PCR primer - gamma gene 
< 4 00 > SEQUENCE: 33 

atgaacttag agccttccta t 21 


<210 > SEQ ID NO 34 

<211> LENGTH: 21 

<212 > TYPE: DNA 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: PCR primer - P2 gamma gene 
<400 > SEQUENCE: 34 


ttaactccat aagaaacttg t 


21 


<210 > SEQ ID NO 35 

<211> LENGTH: 40 

<212 > TYPE: DNA 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: PCR primer - PI primer 


<400 > SEQUENCE: 35 
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gaaagaacat atggcctatt tattaagaga aggaacacag 40 

<210 > SEQ ID NO 36 

<211> LENGTH: 43 

<212 > TYPE: DNA 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: PCR primer - P2 primer 
<400 > SEQUENCE: 36 

taaagtactc gagaaaacct aaataaaata atcatatctt aac 43 

< 2 10 > SEQ ID NO 37 
< 2 11 > LENGTH: 1607 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

< 22 1 > NAME/KEY: misc_feature 

<223 > OTHER INFORMATION: TF55 gamma subunit 
<400 > SEQUENCE: 37 


atggcctatt 

tattaagaga 

aggaacacag 

agatctactg 

gaaacgaggt 

aatactaaac 

60 

aacatagctg 

tagccaaaat 

attactggaa 

atgctaaagt 

caagcctagg 

tcctaagggt 

120 

ttagacaaga 

tgttagttga 

ggggcaagac 

attacaataa 

ctaatgacgg 

tgcgacaata 

180 

gttaaaaaca 

tggaagtgca 

gcatcctact 

gcaaaattac 

tcattgaaac 

cgctaaaact 

240 

gttgataccg 

aggtaggaga 

tgggacaact 

tcagtagtcg 

ttcttgccgg 

gttactatta 

300 

gaaaaagctg 

aggatttgct 

gaatcagaag 

atccatccaa 

ctgtcataat 

agaaggttat 

360 

aggaaggctc 

taagttcatc 

attagaattg 

ttaaaaagta 

ttgcagataa 

gattagtcca 

420 

gaagatagga 

agatagttca 

cgatctagta 

tatactactc 

tatcgagtaa 

gttcttctca 

480 

acagagcata 

ctctagagaa 

gataataaat 

ctagttattg 

aagcttcatt 

ggcggtattg 

540 

gataaaagag 

atggaaccta 

tgatctggat 

attaagaata 

taaagattgt 

aaaagtcaat 

600 

ggtggggaat 

ttgatgatag 

tgagcttgta 

aatgggatcg 

ttgtagataa 

ggagcccacc 

660 

aatgagaata 

tgccgaaaag 

ggcggaaaac 

gttaaggtaa 

tgttagctga 

cttcccatta 

720 

aaacttgaaa 

aaacggaaat 

tagcatgaag 

ctgggaataa 

gtgaccccac 

tcagataaag 

780 

ggatacttgg 

atgaacaaac 

ggcatatgtt 

aagcaaatgg 

tggataagat 

aaaggctatg 

840 

ggcgttaaat 

tgtttattac 

acaaaaggac 

attgatgaag 

tcgcttcata 

tttaatggga 

900 

aaaagtggga 

taatagcgtt 

aaagaacgta 

aagaggagtg 

acatagagtt 

actgagtaga 

960 

gctactggtg 

cgaaaattgc 

aagtagcatg 

aaagacgcta 

atgagagtga 

tttaggggaa 

1020 

gctaaattag 

tggaggttag 

aaatttagga 

aagaacaaat 

acctcttcat 

tcaatctgat 

1080 

aaagctaaag 

cggtgactgt 

aatcataaag 

ggctcgaata 

acatggtaac 

tgatgaagca 

1140 

gaaaggagtt 

taaatgacgc 

ctttaactcc 

ataagaaact 

tgttactaga 

accctatatt 

1200 

gtggctggtg 

gtggtgctgt 

agaggaggag 

ttggctaaga 

ggttaaggga 

gaacgctgga 

1260 

aaagttcccg 

gaaaggagca 

attggcattt 

aatgcatttg 

cggatgcttt 

ggaggagtac 

1320 

gtttcaatac 

tatcagaaac 

tgctggcatg 

gatcccataa 

gtgcgttaac 

cgaaataaga 

1380 

cataaacatg 

caaacgggtt 

aaagaatgct 

gggattgaca 

tagttaaggc 

tagaatttac 

1440 

gataacatgc 

ttgagcttaa 

agtaatcgat 

tctctaaagg 

ttaaggaaca 

agttttaaag 

1500 

agcgccacag 

aagccgctac 

tgcgatttta 

aagatcgacg 

acatgatagc 

agcagctcct 

1560 

gcaaagcaac 

aacctcaacc 

acaacagcca 

aatccatact 

taggtta 


1607 
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<210 > SEQ ID NO 38 
<211> LENGTH: 535 
<212 > TYPE: PRT 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: TF55 gamma subunit 
<400 > SEQUENCE: 38 


Met 

Ala 

Tyr 

Leu 

Leu 

Arg 

Glu 

Gly 

Thr 

Gin 

Arg 

Ser 

Thr 

Gly 

Asn 

Glu 

1 




5 





10 





15 


Val 

lie 

Leu 

Asn 

Asn 

lie 

Ala 

Val 

Ala 

Lys 

lie 

Leu 

Leu 

Glu 

Met 

Leu 




20 





25 





30 



Lys 

Ser 

Ser 

Leu 

Gly 

Pro 

Lys 

Gly 

Leu 

Asp 

Lys 

Met 

Leu 

Val 

Glu 

Gly 



35 





40 





45 




Gin 

Asp 

lie 

Thr 

lie 

Thr 

Asn 

Asp 

Gly 

Ala 

Thr 

lie 

Val 

Lys 

Asn 

Met 


50 





55 





60 





Glu 

Val 

Gin 

His 

Pro 

Thr 

Ala 

Lys 

Leu 

Leu 

lie 

Glu 

Thr 

Ala 

Lys 

Thr 

65 





70 





75 





80 

Val 

Asp 

Thr 

Glu 

Val 

Gly 

Asp 

Gly 

Thr 

Thr 

Ser 

Val 

Val 

Val 

Leu 

Ala 





85 





90 





95 


Gly 

Leu 

Leu 

Leu 

Glu 

Lys 

Ala 

Glu 

Asp 

Leu 

Leu 

Asn 

Gin 

Lys 

lie 

His 




100 





105 





110 



Pro 

Thr 

Val 

lie 

lie 

Glu 

Gly 

Tyr 

Arg 

Lys 

Ala 

Leu 

Ser 

Ser 

Ser 

Leu 



115 





120 





125 




Glu 

Leu 

Leu 

Lys 

Ser 

lie 

Ala 

Asp 

Lys 

lie 

Ser 

Pro 

Glu 

Asp 

Arg 

Lys 


130 





135 





140 





He 

Val 

His 

Asp 

Leu 

Val 

Tyr 

Thr 

Thr 

Leu 

Ser 

Ser 

Lys 

Phe 

Phe 

Ser 

145 





150 





155 





160 

Thr 

Glu 

His 

Thr 

Leu 

Glu 

Lys 

lie 

lie 

Asn 

Leu 

Val 

lie 

Glu 

Ala 

Ser 





165 





170 





175 


Leu 

Ala 

Val 

Leu 

Asp 

Lys 

Arg 

Asp 

Gly 

Thr 

Tyr 

Asp 

Leu 

Asp 

lie 

Lys 




180 





185 





190 



Asn 

lie 

Lys 

lie 

Val 

Lys 

Val 

Asn 

Gly 

Gly 

Glu 

Phe 

Asp 

Asp 

Ser 

Glu 



195 





200 





205 




Leu 

Val 

Asn 

Gly 

lie 

Val 

Val 

Asp 

Lys 

Glu 

Pro 

Thr 

Asn 

Glu 

Asn 

Met 


210 





215 





220 





Pro 

Lys 

Arg 

Ala 

Glu 

Asn 

Val 

Lys 

Val 

Met 

Leu 

Ala 

Asp 

Phe 

Pro 

Leu 

225 





230 





235 





240 

Lys 

Leu 

Glu 

Lys 

Thr 

Glu 

lie 

Ser 

Met 

Lys 

Leu 

Gly 

lie 

Ser 

Asp 

Pro 





245 





250 





255 


Thr 

Gin 

lie 

Lys 

Gly 

Tyr 

Leu 

Asp 

Glu 

Gin 

Thr 

Ala 

Tyr 

Val 

Lys 

Gin 




260 





265 





270 



Met 

Val 

Asp 

Lys 

lie 

Lys 

Ala 

Met 

Gly 

Val 

Lys 

Leu 

Phe 

lie 

Thr 

Gin 



275 





280 





285 




Lys 

Asp 

lie 

Asp 

Glu 

Val 

Ala 

Ser 

Tyr 

Leu 

Met 

Gly 

Lys 

Ser 

Gly 

lie 


290 





295 





300 





lie 

Ala 

Leu 

Lys 

Asn 

Val 

Lys 

Arg 

Ser 

Asp 

lie 

Glu 

Leu 

Leu 

Ser 

Arg 

305 





310 





315 





320 

Ala 

Thr 

Gly 

Ala 

Lys 

lie 

Ala 

Ser 

Ser 

Met 

Lys 

Asp 

Ala 

Asn 

Glu 

Ser 





325 





330 





335 


Asp 

Leu 

Gly 

Glu 

Ala 

Lys 

Leu 

Val 

Glu 

Val 

Arg 

Asn 

Leu 

Gly 

Lys 

Asn 




340 





345 





350 




Lys Tyr Leu Phe lie Gin Ser Asp Lys Ala Lys Ala Val Thr Val lie 
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355 





360 





365 




He 

Lys 

Gly 

Ser 

Asn 

Asn 

Met 

Val 

Thr 

Asp 

Glu 

Ala 

Glu 

Arg 

Ser 

Leu 


370 





375 





380 





Asn Asp 

Ala 

Phe 

Asn 

Ser 

lie 

Arg 

Asn 

Leu 

Leu 

Leu 

Glu 

Pro 

Tyr 

lie 

385 





390 





395 





400 

Val 

Ala 

Gly 

Gly 

Gly 

Ala 

Val 

Glu 

Glu 

Glu 

Leu 

Ala 

Lys 

Arg 

Leu 

Arg 





405 





410 





415 


Glu 

Asn 

Ala 

Gly 

Lys 

Val 

Pro 

Gly 

Lys 

Glu 

Gin 

Leu 

Ala 

Phe 

Asn 

Ala 




420 





425 





430 



Phe 

Ala 

Asp 

Ala 

Leu 

Glu 

Glu 

Tyr 

Val 

Ser 

lie 

Leu 

Ser 

Glu 

Thr 

Ala 



435 





440 





445 




Gly 

Met 

Asp 

Pro 

lie 

Ser 

Ala 

Leu 

Thr 

Glu 

lie 

Arg 

His 

Lys 

His 

Ala 


450 





455 





460 





Asn 

Gly 

Leu 

Lys 

Asn 

Ala 

Gly 

lie 

Asp 

lie 

Val 

Lys 

Ala 

Arg 

lie 

Tyr 

465 





470 





475 





480 

Asp 

Asn 

Met 

Leu 

Glu 

Leu 

Lys 

Val 

lie 

Asp 

Ser 

Leu 

Lys 

Val 

Lys 

Glu 





485 





490 





495 


Gin 

Val 

Leu 

Lys 

Ser 

Ala 

Thr 

Glu 

Ala 

Ala 

Thr 

Ala 

lie 

Leu 

Lys 

lie 




500 





505 





510 



Asp 

Asp 

Met 

lie 

Ala 

Ala 

Ala 

Pro 

Ala 

Lys 

Gin 

Gin 

Pro 

Gin 

Pro 

Gin 



515 





520 





525 




Gin 

Pro 

Asn 

Pro 

Tyr 

Leu 

Gly 











530 





535 










< 2 10 > SEQ ID NO 

39 












< 2 11 > LENGTH: 560 












<212 > TYPE: 

PRT 













<213 > ORGANISM: 

Sulfolobus shibatae 








<220 > FEATURE: 













< 22 1 > NAME/KEY: 

MIS C_FE ATURE 









<223 > OTHER 

INFORMATION 

: TF55 alpha 

subunit 






< 4 00 > SEQUENCE: 

39 












Met 

Ala 

Ser 

Pro 

Val 

Leu 

Leu 

Leu 

Lys 

Glu 

Gly 

Thr 

Ser 

Arg 

Thr 

Thr 

1 




5 





10 





15 


Gly Arg 

Asp 

Ala 

Leu 

Arg 

Asn 

Asn 

lie 

Leu 

Ala 

Ala 

Lys 

Thr 

Leu 

Ala 




20 





25 





30 



Glu 

Met 

Leu 

Arg 

Ser 

Ser 

Leu 

Gly 

Pro 

Lys 

Gly 

Leu 

Asp 

Lys 

Met 

Leu 



35 





40 





45 




lie 

Asp 

Ser 

Phe 

Gly 

Asp 

Val 

Thr 

lie 

Thr 

Asn 

Asp 

Gly 

Ala 

Thr 

lie 


50 





55 





60 





Val 

Lys 

Asp 

Met 

Glu 

lie 

Gin 

His 

Pro 

Ala 

Ala 

Lys 

Leu 

Leu 

Val 

Glu 

65 





70 





75 





80 

Ala 

Ala 

Lys 

Ala 

Gin 

Asp 

Ala 

Glu 

Val 

Gly 

Asp 

Gly 

Thr 

Thr 

Ser 

Ala 





85 





90 





95 


Val 

Val 

Leu 

Ala 

Gly 

Ala 

Leu 

Leu 

Glu 

Lys 

Ala 

Glu 

Ser 

Leu 

Leu 

Asp 




100 





105 





110 



Gin 

Asn 

lie 

His 

Pro 

Thr 

lie 

lie 

lie 

Glu 

Gly 

Tyr 

Lys 

Lys 

Ala 

Tyr 



115 





120 





125 




Thr 

Lys 

Ala 

Leu 

Glu 

Leu 

Leu 

Pro 

Gin 

Leu 

Gly 

Thr 

Arg 

lie 

Asp 

lie 


13 0 





135 





140 





Arg Asp 

Leu 

Asn 

Ser 

Ser 

Val 

Ala 

Arg Asp 

Thr 

Leu 

Arg 

Lys 

lie 

Ala 

145 





150 





155 





160 

Phe 

Thr 

Thr 

Leu 

Ala 

Ser 

Lys 

Phe 

lie 

Ala 

Glu Gly 

Ala 

Glu 

Leu 

Asn 





165 





170 





175 
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Lys 

lie 

lie 

Asp 

Met 

Val 

lie 

Asp 

Ala 

lie 

Val 

Asn 

Val 

Ala 

Glu 

Pro 




180 





185 





190 



Leu 

Pro 

Asn 

Gly Gly 

Tyr 

Asn 

Val 

Ser 

Leu 

Asp 

Leu 

lie 

Lys 

lie 

Asp 



195 





200 





205 




Lys 

Lys 

Lys 

Gly 

Gly 

Ser 

lie 

Glu 

Asp 

Ser 

Val 

Leu 

Val 

Lys 

Gly 

Leu 


210 





215 





220 





Val 

Leu 

Asp 

Lys 

Glu 

Val 

Val 

His 

Pro 

Gly 

Met 

Pro 

Arg 

Arg 

Val 

Thr 

225 





230 





235 





240 

Lys 

Ala 

Lys 

lie 

Ala 

Val 

Leu 

Asp 

Ala 

Ala 

Leu 

Glu 

Val 

Glu 

Lys 

Pro 





245 





250 





255 


Glu 

lie 

Ser 

Ala 

Lys 

lie 

Ser 

lie 

Thr 

Ser 

Pro 

Glu 

Gin 

lie 

Lys 

Ala 




260 





265 





270 



Phe 

Leu 

Asp 

Glu 

Glu 

Ser 

Lys 

Tyr 

Leu 

Lys 

Asp 

Met 

Val 

Asp 

Lys 

Leu 



275 





280 





285 




Ala 

Ser 

lie 

Gly Ala 

Asn 

Val 

Val 

lie 

Cys 

Gin 

Lys 

Gly 

lie 

Asp 

Asp 


290 





295 





300 





He 

Ala 

Gin 

His 

Phe 

Leu 

Ala 

Lys 

Lys 

Gly 

lie 

Leu 

Ala 

Val 

Arg 

Arg 

305 





310 





315 





320 

Val 

Lys 

Arg 

Ser 

Asp 

lie 

Glu 

Lys 

Leu 

Glu 

Lys 

Ala 

Leu 

Gly 

Ala 

Arg 





325 





330 





335 


lie 

lie 

Ser 

Ser 

lie 

Lys 

Asp 

Ala 

Thr 

Pro 

Asp 

Asp 

Leu 

Gly 

Tyr 

Ala 




340 





345 





350 



Glu 

Leu 

Val 

Glu 

Glu 

Arg 

Arg 

Val 

Gly 

Asn 

Asp 

Lys 

Met 

Val 

Phe 

lie 



355 





360 





365 




Glu 

Gly 

Ala 

Lys 

Asn 

Leu 

Lys 

Ala 

Val 

Asn 

lie 

Leu 

Leu 

Arg 

Gly 

Ser 


370 





375 





380 





Asn 

Asp 

Met 

Ala 

Leu 

Asp 

Glu 

Ala 

Glu 

Arg 

Ser 

lie 

Asn 

Asp 

Ala 

Leu 

385 





390 





395 





400 

His 

Ala 

Leu 

Arg 

Asn 

lie 

Leu 

Leu 

Glu 

Pro 

Val 

lie 

Leu 

Pro 

Gly 

Gly 





405 





410 





415 


Gly 

Ala 

lie 

Glu 

Leu 

Glu 

Leu 

Ala 

Met 

Lys 

Leu 

Arg 

Glu 

Tyr 

Ala 

Arg 




420 





425 





430 



Ser 

Val 

Gly 

Gly 

Lys 

Glu 

Gin 

Leu 

Ala 

lie 

Glu 

Ala 

Phe 

Ala 

Asp 

Ala 



435 





440 





445 




Leu 

Glu 

Glu 

lie 

Pro 

Thr 

lie 

Leu 

Ala 

Glu 

Thr 

Ala 

Gly 

Leu 

Glu 

Ala 


450 





455 





460 





lie 

Ser 

Ala 

Leu 

Met 

Asp 

Leu 

Arg 

Ala 

Arg 

His 

Ala 

Lys 

Gly 

Leu 

Thr 

465 





470 





475 





480 

Asn 

Thr 

Gly 

Val 

Asp 

Val 

lie 

Gly 

Gly 

Lys 

lie 

Val 

Asp 

Asp 

Val 

Tyr 





485 





490 





495 


Ala 

Leu 

Asn 

lie 

lie 

Glu 

Pro 

lie 

Arg 

Val 

Lys 

Ala 

Gin 

Val 

Leu 

Lys 




500 





505 





510 



Ser 

Ala 

Thr 

Glu 

Ala 

Ala 

Thr 

Ala 

lie 

Leu 

Lys 

lie 

Asp 

Asp 

Leu 

lie 



515 





520 





525 




Ala 

Ala 

Ala 

Pro 

Leu 

Lys 

Ser 

Glu 

Lys 

Lys 

Gly 

Gly 

Glu 

Gly 

Ser 

Lys 


530 





535 





540 





Glu 

Glu 

Ser 

Gly Gly 

Glu 

Gly 

Gly 

Ala 

Gly 

Thr 

Pro 

Ser 

Leu 

Gly Asp 

545 





550 





555 





560 

<210 

i> SEQ ID NO 

40 












<211 

.> LENGTH: 21 












<212 

; > TYPE: 

PRT 













<213 

> ORGANISM: 

Artificial 










<220 

i> FEATURE: 













<223 

> OTHER 

INFORMATION: 

: metal binding peptide ■ 

- Au 
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114 


-continued 


<400 > SEQUENCE: 40 

Ser Lys Thr Ser Leu Gly Gin Ser Gly Ala Ser Leu Gin Gly Ser Glu 
15 10 15 

Lys Leu Thr Asn Gly 
20 


<210 > SEQ ID NO 41 

<211 > LENGTH: 21 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Au 
< 4 00 > SEQUENCE: 41 

Gin Ala Thr Ser Glu Lys Leu Val Arg Gly Met Glu Gly Ala Ser Leu 
15 10 15 

His Pro Ala Lys Thr 
20 


< 2 10 > SEQ ID NO 42 

< 2 11 > LENGTH: 7 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Pt 

< 4 00 > SEQUENCE: 42 

Asp Arg Thr Ser Thr Trp Arg 
1 5 


< 2 10 > SEQ ID NO 43 

< 2 11 > LENGTH: 7 

<212 > TYPE : PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Pt 

< 4 00 > SEQUENCE: 43 

Gin Ser Val Thr Ser Thr Lys 
1 5 


< 2 10 > SEQ ID NO 44 

<211 > LENGTH: 7 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Pt 

< 4 00 > SEQUENCE: 44 

Ser Ser Ser His Leu Asn Lys 
1 5 


<210 > SEQ ID NO 45 

<211 > LENGTH: 7 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Pd 

<400 > SEQUENCE: 45 

Ser Val Thr Gin Asn Lys Tyr 
1 5 


<210 > SEQ ID NO 46 
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<211> LENGTH: 7 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Pd 

<400 > SEQUENCE: 46 

Ser Pro His Pro Gly Pro Tyr 
1 5 


<210 > SEQ ID NO 47 

<211 > LENGTH: 7 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Pd 

< 4 00 > SEQUENCE: 47 

His Ala Pro Thr Pro Met Leu 
1 5 


< 2 10 > SEQ ID NO 48 
< 2 11 > LENGTH: 12 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Si02 

< 4 00 > SEQUENCE: 48 

Met Ser Pro His Pro His Pro Arg His His His Thr 
15 10 


< 2 10 > SEQ ID NO 49 
<211 > LENGTH: 12 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Si02 

< 4 00 > SEQUENCE: 49 

Arg Gly Arg Arg Arg Arg Leu Ser Cys Arg Leu Leu 
15 10 


< 2 10 > SEQ ID NO 50 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Si02 
< 4 00 > SEQUENCE: 50 

Lys Pro Ser His His His His His Thr Gly Ala Asn 
15 10 


<210 > SEQ ID NO 51 

<211> LENGTH: 21 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - zeolites 
<400 > SEQUENCE: 51 

Val Lys Thr Gin Ala Thr Ser Arg Glu Glu Pro Pro Arg Leu Pro Ser 
15 10 15 

Lys His Arg Pro Gly 
20 
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<210 > SEQ ID NO 52 

<211> LENGTH: 14 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - zeolites 
< 4 00 > SEQUENCE: 52 

Met Asp His Gly Lys Tyr Arg Gin Lys Gin Ala Thr Pro Gly 
15 10 


<210 > SEQ ID NO 53 

< 2 11 > LENGTH: 20 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - ZnO 
< 4 00 > SEQUENCE: 53 

Asn Thr Arg Met Thr Ala Arg Gin His Arg Ser Ala Asn His Lys Ser 
15 10 15 

Thr Gin Arg Ala 
20 


< 2 10 > SEQ ID NO 54 
< 2 11 > LENGTH: 9 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide ZnO 

< 4 00 > SEQUENCE: 54 

Tyr Asp Ser Arg Ser Met Arg Pro His 
1 5 


<210 > SEQ ID NO 55 

<211> LENGTH: 12 

<212 > TYPE : PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - CaC03 
< 4 00 > SEQUENCE: 55 

His Thr Gin Asn Met Arg Met Tyr Glu Pro Trp Phe 
15 10 


<210 > SEQ ID NO 56 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - CaC03 
< 4 00 > SEQUENCE: 56 

Asp Val Phe Ser Ser Phe Asn Leu Lys His Met Arg 
15 10 


<210 > SEQ ID NO 57 

<211> LENGTH: 8 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Cr203 
<400 > SEQUENCE: 57 


Trp Arg Pro Lys Ala Ala Thr Asn 
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1 5 


<210 > SEQ ID NO 58 

<211 > LENGTH: 9 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Cr203 
< 4 00 > SEQUENCE: 58 

Arg lie Arg His Arg Leu Val Gly Gin 
1 5 


< 2 10 > SEQ ID NO 59 

< 2 11 > LENGTH: 9 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Fe203 
< 4 00 > SEQUENCE: 59 

Arg Arg Thr Val Lys His His Val Asn 
1 5 


< 2 10 > SEQ ID NO 60 
< 2 11 > LENGTH: 11 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs 

< 4 00 > SEQUENCE: 60 

Ala Gin Asn Pro Ser Asp Asn Asn Thr Thr His 
15 10 


< 2 10 > SEQ ID NO 61 

<211 > LENGTH: 12 

<212 > TYPE : PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs 
<400 > SEQUENCE: 61 

Arg Leu Glu Leu Ala lie Pro Leu Gin Gly Ser Gly 
15 10 


< 2 10 > SEQ ID NO 62 

<211 > LENGTH: 12 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - GaAs 
<400 > SEQUENCE: 62 

Thr Pro Pro Arg Pro lie Gin Tyr Asn His Thr Ser 
15 10 


<210 > SEQ ID NO 63 

<211 > LENGTH: 7 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - ZnS 
<400 > SEQUENCE: 63 


Asn Asn Pro Met His Gin Asn 
1 5 
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-continued 


<210 > SEQ ID NO 64 

<211 > LENGTH: 6 

<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: flexible peptide linker 

< 4 00 > SEQUENCE: 64 

Gly Gly Ser Gly Gly Thr 
1 5 


< 2 10 > SEQ ID NO 65 

< 2 11 > LENGTH: 18 

<212 > TYPE: DNA 

<213 > ORGANISM: Artificial 

<220 > FEATURE: 

<223 > OTHER INFORMATION: flexible linker 
<400 > SEQUENCE: 65 

ggtggttctg gtggtacc 18 


< 2 10 > SEQ ID NO 66 
< 2 11 > LENGTH: 1575 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

< 22 1 > NAME/KEY: misc_feature 

<223 > OTHER INFORMATION: chaperonin perimitant 153 
< 4 00 > SEQUENCE: 66 


atggcaataa 

atgatactga 

cgtacttagg 

aaagtagcat 

taacatcctt 

aggcagtaag 

60 

gcagtagcag 

gcgcacgaga 

gtatttagct 

gaccttgtgg 

ttaaagcagt 

ggcacaagta 

120 

gcagaattaa 

gaggagataa 

gtggtatgtt 

gatctagata 

atgtacaaat 

agttaaaaaa 

180 

catggtggta 

gcattaatga 

tactcaatta 

gtatacggca 

tagtagttga 

taaggaagtt 

240 

gtacatccgg 

gcatgccaaa 

gaggattgaa 

aatgctaaga 

tagccctttt 

agacgcttca 

300 

ttagaagttg 

agaaacccga 

attggatgca 

gaaataagaa 

ttaacgatcc 

aacacagatg 

360 

cacaaattct 

tggaagaaga 

agaaaacata 

ttgaaagaaa 

aagtagataa 

gattgcagct 

420 

actggtgcta 

acgttgtaat 

agcgcagaaa 

ggtatcgatg 

aagttgcaca 

acactattta 

480 

gctaagaaag 

gtatattagc 

tgttaggaga 

gccaagaaga 

gtgatttaga 

gaaattagct 

540 

agagctaccg 

gaggtagagt 

catatcaaat 

attgatgaat 

taacttcaca 

agatctaggt 

600 

tatgccgcat 

tagtggaaga 

gagaaaagta 

ggtgaagaca 

agatcgtatt 

cgtagaaggt 

660 

gcaaagaatc 

caaaatcagt 

tagtatacta 

ataagaggag 

gattagagag 

agtagtagat 

720 

gagactgaaa 

gagctcttag 

ggacgcttta 

ggtacagtgg 

cagatgtaat 

aagggatggt 

780 

agagcagtag 

ctggtggtgg 

agctgttgag 

atagagatag 

ctaagagatt 

aagaaagtat 

840 

gccccacaag 

ttggtggtaa 

agagcaatta 

gcaattgaag 

catatgctaa 

tgcaatagag 

900 

ggtctcatta 

tgatattggc 

ggaaaacgca 

ggattagatc 

ctatagacaa 

attaatgcaa 

960 

ttaagaagtc 

ttcacgagaa 

tgagaccaat 

aaatggtatg 

gacttaattt 

atttactgga 

1020 

aatccagagg 

atatgtggaa 

attaggtgtt 

attgaaccgg 

cactagtgaa 

aatgaatgca 

1080 

attaaggctg 

caacagaagc 

agtaacatta 

gtgttaagaa 

tagatgatat 

tgtaggtggt 

1140 

tctggtggta 

ccatacctgt 

aataatttta 

aaagagggat 

caagtagaac 

atatggaaaa 

1200 

gaagctttaa 

gggctaatat 

tgctgcagtg 

aaagcaattg 

aagaggcatt 

aaaaagcacc 

1260 
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-continued 


tatggtccac 

gtggaatgga 

taagattctt 

gttgatagct 

taggagatat 

tacaataaca 

1320 

aatgatggag 

ccactattct 

tgataaaatg 

gatttacaac 

acccaacagg 

taagctttta 

1380 

gttcagatag 

ctaaaggaca 

agacgaggaa 

acagctgatg 

gcactaaaac 

tgctgtaatt 

1440 

cttgctggag 

aattagctaa 

aaaagcagaa 

gatcttttat 

ataaggagat 

tcacccaaca 

1500 

ataattgtaa 

gcggatataa 

gaaggcagaa 

gaaattgcat 

taaagaccat 

ccaagatata 

1560 

gcacaaccgg 

tcagc 





1575 


<210 > SEQ ID NO 67 
<211 > LENGTH: 525 
<212 > TYPE: PRT 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: chaperonin permutant 153 
<400 > SEQUENCE: 67 


Met 

Ala 

lie 

Asn 

Asp 

Thr 

Asp 

Val 

Leu 

Arg 

Lys 

Val 

Ala 

Leu 

Thr 

Ser 

1 




5 





10 





15 


Leu 

Gly 

Ser 

Lys 

Ala 

Val 

Ala 

Gly 

Ala 

Arg 

Glu 

Tyr 

Leu 

Ala 

Asp 

Leu 




20 





25 





30 



Val 

Val 

Lys 

Ala 

Val 

Ala 

Gin 

Val 

Ala 

Glu 

Leu 

Arg 

Gly 

Asp 

Lys 

Trp 



35 





40 





45 




Tyr 

Val 

Asp 

Leu 

Asp 

Asn 

Val 

Gin 

lie 

Val 

Lys 

Lys 

His 

Gly 

Gly 

Ser 


50 





55 





60 





He 

Asn 

Asp 

Thr 

Gin 

Leu 

Val 

Tyr 

Gly 

lie 

Val 

Val 

Asp 

Lys 

Glu 

Val 

65 





70 





75 





80 

Val 

His 

Pro 

Gly 

Met 

Pro 

Lys 

Arg 

lie 

Glu 

Asn 

Ala 

Lys 

lie 

Ala 

Leu 





85 





90 





95 


Leu 

Asp 

Ala 

Ser 

Leu 

Glu 

Val 

Glu 

Lys 

Pro 

Glu 

Leu 

Asp 

Ala 

Glu 

lie 




100 





105 





110 



Arg 

lie 

Asn 

Asp 

Pro 

Thr 

Gin 

Met 

His 

Lys 

Phe 

Leu 

Glu 

Glu 

Glu 

Glu 



115 





120 





125 




Asn 

lie 

Leu 

Lys 

Glu 

Lys 

Val 

Asp 

Lys 

lie 

Ala 

Ala 

Thr 

Gly 

Ala 

Asn 


130 





135 





140 





Val 

Val 

lie 

Ala 

Gin 

Lys 

Gly 

lie 

Asp 

Glu 

Val 

Ala 

Gin 

His 

Tyr 

Leu 

145 





150 





155 





160 

Ala 

Lys 

Lys 

Gly 

lie 

Leu 

Ala 

Val 

Arg 

Arg 

Ala 

Lys 

Lys 

Ser 

Asp 

Leu 





165 





170 





175 


Glu 

Lys 

Leu 

Ala 

Arg 

Ala 

Thr 

Gly 

Gly 

Arg 

Val 

lie 

Ser 

Asn 

lie 

Asp 




180 





185 





190 



Glu 

Leu 

Thr 

Ser 

Gin 

Asp 

Leu 

Gly 

Tyr 

Ala 

Ala 

Leu 

Val 

Glu 

Glu 

Arg 



195 





200 





205 




Lys 

Val 

Gly 

Glu 

Asp 

Lys 

lie 

Val 

Phe 

Val 

Glu 

Gly 

Ala 

Lys 

Asn 

Pro 


210 





215 





220 





Lys 

Ser 

Val 

Ser 

lie 

Leu 

lie 

Arg 

Gly 

Gly 

Leu 

Glu 

Arg 

Val 

Val 

Asp 

225 





230 





235 





240 

Glu 

Thr 

Glu 

Arg 

Ala 

Leu 

Arg 

Asp 

Ala 

Leu 

Gly 

Thr 

Val 

Ala 

Asp 

Val 





245 





250 





255 


lie 

Arg 

Asp 

Gly 

Arg 

Ala 

Val 

Ala 

Gly 

Gly 

Gly 

Ala 

Val 

Glu 

lie 

Glu 




260 





265 





270 



lie 

Ala 

Lys 

Arg 

Leu 

Arg 

Lys 

Tyr 

Ala 

Pro 

Gin 

Val 

Gly 

Gly 

Lys 

Glu 



275 





280 





285 





Gin Leu Ala lie Glu Ala Tyr Ala Asn Ala lie Glu Gly Leu lie Met 
290 295 300 
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-continued 


He 

Leu 

Ala 

Glu 

Asn 

Ala 

Gly 

Leu 

Asp 

Pro 

lie 

Asp 

Lys 

Leu 

Met 

Gin 

305 





310 





315 





320 

Leu Arg 

Ser 

Leu 

His 

Glu 

Asn 

Glu 

Thr 

Asn 

Lys 

Trp 

Tyr 

Gly 

Leu 

Asn 





325 





330 





335 


Leu 

Phe 

Thr 

Gly 

Asn 

Pro 

Glu 

Asp 

Met 

Trp 

Lys 

Leu 

Gly 

Val 

lie 

Glu 




340 





345 





350 



Pro 

Ala 

Leu 

Val 

Lys 

Met 

Asn 

Ala 

lie 

Lys 

Ala 

Ala 

Thr 

Glu 

Ala 

Val 



355 





360 





365 




Thr 

Leu 

Val 

Leu 

Arg 

lie 

Asp 

Asp 

lie 

Val 

Gly 

Gly 

Ser 

Gly 

Gly 

Thr 


370 





375 





380 





lie 

Pro 

Val 

lie 

lie 

Leu 

Lys 

Glu 

Gly 

Ser 

Ser 

Arg 

Thr 

Tyr 

Gly 

Lys 

385 





390 





395 





400 

Glu 

Ala 

Leu 

Arg 

Ala 

Asn 

lie 

Ala 

Ala 

Val 

Lys 

Ala 

lie 

Glu 

Glu 

Ala 





405 





410 





415 


Leu 

Lys 

Ser 

Thr 

Tyr 

Gly 

Pro 

Arg 

Gly 

Met 

Asp 

Lys 

lie 

Leu 

Val 

Asp 




420 





425 





430 



Ser 

Leu 

Gly 

Asp 

lie 

Thr 

lie 

Thr 

Asn 

Asp 

Gly 

Ala 

Thr 

lie 

Leu 

Asp 



435 





440 





445 




Lys 

Met 

Asp 

Leu 

Gin 

His 

Pro 

Thr 

Gly 

Lys 

Leu 

Leu 

Val 

Gin 

lie 

Ala 


450 





455 





460 





Lys 

Gly 

Gin 

Asp 

Glu 

Glu 

Thr 

Ala 

Asp 

Gly 

Thr 

Lys 

Thr 

Ala 

Val 

lie 

465 





470 





475 





480 

Leu 

Ala 

Gly 

Glu 

Leu 

Ala 

Lys 

Lys 

Ala 

Glu 

Asp 

Leu 

Leu 

Tyr 

Lys 

Glu 





485 





490 





495 


lie 

His 

Pro 

Thr 

lie 

lie 

Val 

Ser 

Gly 

Tyr 

Lys 

Lys 

Ala 

Glu 

Glu 

lie 




500 





505 





510 



Ala 

Leu 

Lys 

Thr 

lie 

Gin 

Asp 

lie 

Ala 

Gin 

Pro 

Val 

Ser 






515 





520 





525 





<210 > SEQ ID NO 68 
<211> LENGTH: 1575 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221> NAME/KEY: misc_feature 

<223 > OTHER INFORMATION: chaperonin permutant 267 
<400 > SEQUENCE: 68 


atggcagatc 

caacacagat 

gcacaaattc 

ttggaagaag 

aagaaaacat 

attgaaagaa 

60 

aaagtagata 

agattgcagc 

tactggtgct 

aacgttgtaa 

tagcgcagaa 

aggtatcgat 

120 

gaagttgcac 

aacactattt 

agctaagaaa 

ggtatattag 

ctgttaggag 

agccaagaag 

180 

agtgatttag 

agaaattagc 

tagagctacc 

ggaggtagag 

tcatatcaaa 

tattgatgaa 

240 

ttaacttcac 

aagatctagg 

ttatgccgca 

ttagtggaag 

agagaaaagt 

aggtgaagac 

300 

aagatcgtat 

tcgtagaagg 

tgcaaagaat 

ccaaaatcag 

ttagtatact 

aataagagga 

360 

ggattagaga 

gagtagtaga 

tgagactgaa 

agagctctta 

gggacgcttt 

aggtacagtg 

420 

gcagatgtaa 

taagggatgg 

tagagcagta 

gctggtggtg 

gagctgttga 

gatagagata 

480 

gctaagagat 

taagaaagta 

tgccccacaa 

gttggtggta 

aagagcaatt 

agcaattgaa 

540 

gcatatgcta 

atgcaataga 

gggtctcatt 

atgatattgg 

cggaaaacgc 

aggattagat 

600 

cctatagaca 

aattaatgca 

attaagaagt 

cttcacgaga 

atgagaccaa 

taaatggtat 

660 

ggacttaatt 

tatttactgg 

aaatccagag 

gatatgtgga 

aattaggtgt 

tattgaaccg 

720 

gcactagtga 

aaatgaatgc 

aattaaggct 

gcaacagaag 

cagtaacatt 

agtgttaaga 

780 
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-continued 


atagatgata 

ttgtaggtgg 

ttctggtggt 

accatacctg 

taataatttt 

aaaagaggga 

840 

tcaagtagaa 

catatggaaa 

agaagcttta 

agggctaata 

ttgctgcagt 

gaaagcaatt 

900 

gaagaggcat 

taaaaagcac 

ctatggtcca 

cgtggaatgg 

ataagattct 

tgttgatagc 

960 

ttaggagata 

ttacaataac 

aaatgatgga 

gccactattc 

ttgataaaat 

ggatttacaa 

1020 

cacccaacag 

gtaagctttt 

agttcagata 

gctaaaggac 

aagacgagga 

aacagctgat 

1080 

ggcactaaaa 

ctgctgtaat 

tcttgctgga 

gaattagcta 

aaaaagcaga 

agatctttta 

1140 

tataaggaga 

ttcacccaac 

aataattgta 

agcggatata 

agaaggcaga 

agaaattgca 

1200 

ttaaagacca 

tccaagatat 

agcacaaccg 

gtcagcataa 

atgatactga 

cgtacttagg 

1260 

aaagtagcat 

taacatcctt 

aggcagtaag 

gcagtagcag 

gcgcacgaga 

gtatttagct 

1320 

gaccttgtgg 

ttaaagcagt 

ggcacaagta 

gcagaattaa 

gaggagataa 

gtggtatgtt 

1380 

gatctagata 

atgtacaaat 

agttaaaaaa 

catggtggta 

gcattaatga 

tactcaatta 

1440 

gtatacggca 

tagtagttga 

taaggaagtt 

gtacatccgg 

gcatgccaaa 

gaggattgaa 

1500 

aatgctaaga 

tagccctttt 

agacgcttca 

ttagaagttg 

agaaacccga 

attggatgca 

1560 

gaaataagaa 

ttaac 





1575 


< 2 10 > SEQ ID NO 69 
< 2 11 > LENGTH: 525 
<212 > TYPE: PRT 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FE ATURE 

<223 > OTHER INFORMATION: chaperonin permutant 267 


< 4 00 > SEQUENCE: 69 


Met 

Ala 

Asp 

Pro 

Thr 

Gin 

Met 

His 

Lys 

Phe 

Leu 

Glu 

Glu 

Glu 

Glu 

Asn 

1 




5 





10 





15 


He 

Leu 

Lys 

Glu 

Lys 

Val 

Asp 

Lys 

lie 

Ala 

Ala 

Thr 

Gly 

Ala 

Asn 

Val 




20 





25 





30 



Val 

lie 

Ala 

Gin 

Lys 

Gly 

lie 

Asp 

Glu 

Val 

Ala 

Gin 

His 

Tyr 

Leu 

Ala 



35 





40 





45 




Lys 

Lys 

Gly 

lie 

Leu 

Ala 

Val 

Arg 

Arg 

Ala 

Lys 

Lys 

Ser 

Asp 

Leu 

Glu 


50 





55 





60 





Lys 

Leu 

Ala 

Arg 

Ala 

Thr 

Gly 

Gly 

Arg 

Val 

lie 

Ser 

Asn 

lie 

Asp 

Glu 

65 





70 





75 





80 

Leu 

Thr 

Ser 

Gin 

Asp 

Leu Gly 

Tyr 

Ala 

Ala 

Leu 

Val 

Glu 

Glu 

Arg 

Lys 





85 





90 





95 


Val 

Gly 

Glu 

Asp 

Lys 

lie 

Val 

Phe 

Val 

Glu 

Gly 

Ala 

Lys 

Asn 

Pro 

Lys 




100 





105 





110 



Ser 

Val 

Ser 

lie 

Leu 

lie 

Arg 

Gly 

Gly 

Leu 

Glu 

Arg 

Val 

Val 

Asp 

Glu 



115 





120 





125 




Thr 

Glu 

Arg 

Ala 

Leu 

Arg Asp 

Ala 

Leu 

Gly 

Thr 

Val 

Ala 

Asp 

Val 

lie 


130 





135 





140 





Arg Asp 

Gly 

Arg 

Ala 

Val 

Ala 

Gly 

Gly 

Gly 

Ala 

Val 

Glu 

lie 

Glu 

lie 

145 





150 





155 





160 

Ala 

Lys 

Arg 

Leu 

Arg 

Lys 

Tyr 

Ala 

Pro 

Gin 

Val 

Gly 

Gly 

Lys 

Glu 

Gin 





165 





170 





175 


Leu 

Ala 

lie 

Glu 

Ala 

Tyr 

Ala 

Asn 

Ala 

lie 

Glu 

Gly 

Leu 

lie 

Met 

lie 




180 





185 





190 




Leu Ala Glu Asn Ala Gly Leu Asp Pro lie Asp Lys Leu Met Gin Leu 
195 200 205 
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Arg 

Ser 

Leu 

His 

Glu 

Asn 

Glu 

Thr 

Asn 

Lys 

Trp 

Tyr 

Gly 

Leu 

Asn 

Leu 


210 





215 





220 





Phe 

Thr 

Gly 

Asn 

Pro 

Glu 

Asp 

Met 

Trp 

Lys 

Leu 

Gly 

Val 

lie 

Glu 

Pro 

225 





230 





235 





240 

Ala 

Leu 

Val 

Lys 

Met 

Asn 

Ala 

lie 

Lys 

Ala 

Ala 

Thr 

Glu 

Ala 

Val 

Thr 





245 





250 





255 


Leu 

Val 

Leu 

Arg 

lie 

Asp 

Asp 

lie 

Val 

Gly 

Gly 

Ser 

Gly 

Gly 

Thr 

lie 




260 





265 





270 



Pro 

Val 

He 

lie 

Leu 

Lys 

Glu Gly 

Ser 

Ser 

Arg 

Thr 

Tyr 

Gly 

Lys 

Glu 



275 





280 





285 




Ala 

Leu 

Arg 

Ala 

Asn 

lie 

Ala 

Ala 

Val 

Lys 

Ala 

lie 

Glu 

Glu 

Ala 

Leu 


290 





295 





300 





Lys 

Ser 

Thr 

Tyr 

Gly 

Pro 

Arg 

Gly 

Met 

Asp 

Lys 

lie 

Leu 

Val 

Asp 

Ser 

305 





310 





315 





320 

Leu 

Gly 

Asp 

lie 

Thr 

lie 

Thr 

Asn 

Asp 

Gly 

Ala 

Thr 

lie 

Leu 

Asp 

Lys 





325 





330 





335 


Met 

Asp 

Leu 

Gin 

His 

Pro 

Thr 

Gly 

Lys 

Leu 

Leu 

Val 

Gin 

lie 

Ala 

Lys 




340 





345 





350 



Gly 

Gin 

Asp 

Glu 

Glu 

Thr 

Ala 

Asp 

Gly 

Thr 

Lys 

Thr 

Ala 

Val 

lie 

Leu 



355 





360 





365 




Ala 

Gly 

Glu 

Leu 

Ala 

Lys 

Lys 

Ala 

Glu 

Asp 

Leu 

Leu 

Tyr 

Lys 

Glu 

lie 


370 





375 





380 





His 

Pro 

Thr 

lie 

lie 

Val 

Ser 

Gly 

Tyr 

Lys 

Lys 

Ala 

Glu 

Glu 

lie 

Ala 

385 





390 





395 





400 

Leu 

Lys 

Thr 

lie 

Gin 

Asp 

lie 

Ala 

Gin 

Pro 

Val 

Ser 

lie 

Asn 

Asp 

Thr 





405 





410 





415 


Asp 

Val 

Leu 

Arg 

Lys 

Val 

Ala 

Leu 

Thr 

Ser 

Leu 

Gly 

Ser 

Lys 

Ala 

Val 




420 





425 





430 



Ala 

Gly 

Ala 

Arg 

Glu 

Tyr 

Leu 

Ala 

Asp 

Leu 

Val 

Val 

Lys 

Ala 

Val 

Ala 



435 





440 





445 




Gin 

Val 

Ala 

Glu 

Leu 

Arg 

Gly Asp 

Lys 

Trp 

Tyr 

Val 

Asp 

Leu 

Asp 

Asn 


450 





455 





460 





Val 

Gin 

lie 

Val 

Lys 

Lys 

His 

Gly 

Gly 

Ser 

lie 

Asn 

Asp 

Thr 

Gin 

Leu 

465 





470 





475 





480 

Val 

Tyr 

Gly 

lie 

Val 

Val 

Asp 

Lys 

Glu 

Val 

Val 

His 

Pro 

Gly 

Met 

Pro 





485 





490 





495 


Lys 

Arg 

lie 

Glu 

Asn 

Ala 

Lys 

lie 

Ala 

Leu 

Leu 

Asp 

Ala 

Ser 

Leu 

Glu 




500 





505 





510 



Val 

Glu 

Lys 

Pro 

Glu 

Leu 

Asp 

Ala 

Glu 

lie 

Arg 

lie 

Asn 






515 





520 





525 





<210 > SEQ ID NO 70 
<211> LENGTH: 1575 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221> NAME/KEY: misc_f eature 

<223 > OTHER INFORMATION: chaperonin permutant 316 
<400 > SEQUENCE: 70 


atggcaatat 

tagctgttag 

gagagccaag 

aagagtgatt 

tagagaaatt 

agctagagct 

60 

accggaggta 

gagtcatatc 

aaatattgat 

gaattaactt 

cacaagatct 

aggttatgcc 

120 

gcattagtgg 

aagagagaaa 

agtaggtgaa 

gacaagatcg 

tattcgtaga 

aggtgcaaag 

180 

aatccaaaat 

cagttagtat 

actaataaga 

ggaggattag 

agagagtagt 

agatgagact 

240 
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132 


-continued 


gaaagagctc 

ttagggacgc 

tttaggtaca 

gtggcagatg 

taataaggga 

tggtagagca 

300 

gtagctggtg 

gtggagctgt 

tgagatagag 

atagctaaga 

gattaagaaa 

gtatgcccca 

360 

caagttggtg 

gtaaagagca 

attagcaatt 

gaagcatatg 

ctaatgcaat 

agagggtcto 

420 

attatgatat 

tggoggaaaa 

cgcaggatta 

gatcctatag 

acaaattaat 

gcaattaaga 

480 

agtcttcacg 

agaatgagac 

caataaatgg 

tatggactta 

atttatttac 

tggaaatcca 

540 

gaggatatgt 

ggaaattagg 

tgttattgaa 

ccggcactag 

tgaaaatgaa 

tgcaattaag 

600 

gctgcaacag 

aagcagtaac 

attagtgtta 

agaatagatg 

atattgtagg 

tggttctggt 

660 

ggtaccatac 

ctgtaataat 

tttaaaagag 

ggatcaagta 

gaacatatgg 

aaaagaagct 

720 

ttaagggcta 

atattgctgc 

agtgaaagca 

attgaagagg 

cattaaaaag 

cacctatggt 

780 

ccacgtggaa 

tggataagat 

tcttgttgat 

agcttaggag 

atattacaat 

aacaaatgat 

840 

ggagccacta 

ttcttgataa 

aatggattta 

caacacccaa 

caggtaagct 

tttagttcag 

900 

atagctaaag 

gacaagacga 

ggaaacagct 

gatggcacta 

aaactgctgt 

aattcttgct 

960 

ggagaattag 

ctaaaaaagc 

agaagatctt 

ttatataagg 

agattcaccc 

aacaataatt 

1020 

gtaagcggat 

ataagaaggc 

agaagaaatt 

gcattaaaga 

ccatccaaga 

tatagcacaa 

1080 

ccggtcagca 

taaatgatac 

tgacgtactt 

aggaaagtag 

cattaacatc 

cttaggcagt 

1140 

aaggcagtag 

caggcgcacg 

agagtattta 

gctgaccttg 

tggttaaagc 

agtggcacaa 

1200 

gtagcagaat 

taagaggaga 

taagtggtat 

gttgatctag 

ataatgtaca 

aatagttaaa 

1260 

aaacatggtg 

gtagcattaa 

tgatactcaa 

ttagtatacg 

gcatagtagt 

tgataaggaa 

1320 

gttgtacatc 

ogggoatg 00 

aaagaggatt 

gaaaatgcta 

agatagccct 

tttagacgct 

1380 

tcattagaag 

ttgagaaacc 

cgaattggat 

gcagaaataa 

gaattaacga 

tccaacacag 

1440 

atgcacaaat 

tcttggaaga 

agaagaaaac 

atattgaaag 

aaaaagtaga 

taagattgca 

1500 

gctactggtg 

ctaacgttgt 

aatagcgcag 

aaaggtatcg 

atgaagttgc 

acaacactat 

1560 


ttagctaaga aaggt 1575 

<210 > SEQ ID NO 71 
<211> LENGTH: 525 
<212 > TYPE : PRT 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

< 22 1 > NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: chaperonin permutant 316 
< 4 00 > SEQUENCE: 71 


Met 

Ala 

He 

Leu 

Ala 

Val 

Arg 

Arg 

Ala 

Lys 

Lys 

Ser 

Asp 

Leu 

Glu 

Lys 

1 




5 





10 





15 


Leu 

Ala 

Arg 

Ala 

Thr 

Gly Gly 

Arg 

Val 

lie 

Ser 

Asn 

lie 

Asp 

Glu 

Leu 




20 





25 





30 



Thr 

Ser 

Gin Asp 

Leu 

Gly 

Tyr 

Ala 

Ala 

Leu 

Val 

Glu 

Glu 

Arg 

Lys 

Val 



35 





40 





45 




Gly Glu Asp 

Lys 

lie 

Val 

Phe 

Val 

Glu 

Gly 

Ala 

Lys 

Asn 

Pro 

Lys 

Ser 


50 





55 





60 





Val 

Ser 

lie 

Leu 

lie 

Arg 

Gly 

Gly 

Leu 

Glu 

Arg 

Val 

Val 

Asp 

Glu 

Thr 

65 





70 





75 





80 

Glu Arg 

Ala 

Leu Arg 

Asp 

Ala 

Leu 

Gly 

Thr 

Val 

Ala 

Asp 

Val 

lie 

Arg 





85 





90 





95 


Asp 

Gly Arg Ala 

Val 

Ala 

Gly 

Gly 

Gly 

Ala 

Val 

Glu 

lie 

Glu 

lie 

Ala 




100 





105 





110 




Lys Arg Leu Arg Lys Tyr Ala Pro Gin Val Gly Gly Lys Glu Gin Leu 
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Ala He 
13 0 

Ala Glu 
145 

Ser Leu 


Thr Gly 


Leu Val 


Val Leu 
210 

Val lie 
225 

Leu Arg 

Ser Thr 

Gly Asp 

Asp Leu 
290 

Gin Asp 
305 

Gly Glu 
Pro Thr 


Lys Thr 

Val Leu 
370 

Gly Ala 
385 

Val Ala 

Gin lie 

Tyr Gly 

Arg lie 
450 

Glu Lys 
465 

Met His 
Asp Lys 
lie Asp 


115 

Glu Ala 

Asn Ala 

His Glu 

Asn Pro 
180 

Lys Met 
195 

Arg lie 

lie Leu 

Ala Asn 

Tyr Gly 
260 

lie Thr 
275 

Gin His 

Glu Glu 

Leu Ala 

lie lie 
340 

lie Gin 
355 

Arg Lys 

Arg Glu 

Glu Leu 

Val Lys 
420 

lie Val 
435 

Glu Asn 

Pro Glu 

Lys Phe 

lie Ala 
500 

Glu Val 
515 


Tyr Ala 

Gly Leu 
150 

Asn Glu 
165 

Glu Asp 

Asn Ala 

Asp Asp 

Lys Glu 
230 

lie Ala 
245 

Pro Arg 

lie Thr 

Pro Thr 

Thr Ala 
310 

Lys Lys 
325 

Val Ser 

Asp lie 

Val Ala 

Tyr Leu 
390 

Arg Gly 
405 

Lys His 

Val Asp 

Ala Lys 

Leu Asp 
470 

Leu Glu 
485 

Ala Thr 
Ala Gin 


120 

Asn Ala 
135 

Asp Pro 

Thr Asn 

Met Trp 

lie Lys 
200 

lie Val 
215 

Gly Ser 

Ala Val 

Gly Met 

Asn Asp 
280 

Gly Lys 
295 

Asp Gly 

Ala Glu 

Gly Tyr 

Ala Gin 
360 

Leu Thr 
375 

Ala Asp 

Asp Lys 

Gly Gly 

Lys Glu 
440 

lie Ala 
455 

Ala Glu 

Glu Glu 

Gly Ala 

His Tyr 
520 


lie Glu 

lie Asp 

Lys Trp 
170 

Lys Leu 
185 

Ala Ala 

Gly Gly 

Ser Arg 

Lys Ala 
250 

Asp Lys 
265 

Gly Ala 

Leu Leu 

Thr Lys 

Asp Leu 
330 

Lys Lys 
345 

Pro Val 

Ser Leu 

Leu Val 

Trp Tyr 
410 

Ser lie 
425 

Val Val 

Leu Leu 

lie Arg 

Glu Asn 
490 

Asn Val 
505 

Leu Ala 


Gly Leu 
140 

Lys Leu 
155 

Tyr Gly 

Gly Val 

Thr Glu 

Ser Gly 
220 

Thr Tyr 
235 

lie Glu 

lie Leu 

Thr lie 

Val Gin 
300 

Thr Ala 
315 

Leu Tyr 

Ala Glu 

Ser lie 

Gly Ser 
380 

Val Lys 
395 

Val Asp 

Asn Asp 

His Pro 

Asp Ala 
460 

lie Asn 
475 

lie Leu 
Val lie 
Lys Lys 


125 

lie Met 
Met Gin 
Leu Asn 


lie Glu 
190 

Ala Val 
205 

Gly Thr 

Gly Lys 

Glu Ala 

Val Asp 
270 

Leu Asp 
285 

lie Ala 
Val lie 


Lys Glu 


Glu lie 
350 

Asn Asp 
365 

Lys Ala 


Ala Val 


Leu Asp 

Thr Gin 
430 

Gly Met 
445 

Ser Leu 

Asp Pro 

Lys Glu 

Ala Gin 
510 

Gly 

525 


lie Leu 

Leu Arg 
160 

Leu Phe 
175 

Pro Ala 
Thr Leu 


lie Pro 


Glu Ala 
240 

Leu Lys 
255 

Ser Leu 

Lys Met 

Lys Gly 

Leu Ala 
320 

lie His 
335 

Ala Leu 

Thr Asp 

Val Ala 

Ala Gin 
400 

Asn Val 
415 

Leu Val 

Pro Lys 

Glu Val 

Thr Gin 
480 

Lys Val 
495 

Lys Gly 


<210 > SEQ ID NO 72 




135 


US 7,795,388 B2 


136 
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<211> LENGTH: 1575 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221> NAME/KEY: misc_feature 

<223 > OTHER INFORMATION: chaperonin permutant 480 
< 4 00 > SEQUENCE: 72 


atggcaaata 

aatggtatgg 

acttaattta 

tttactggaa 

atccagagga 

tatgtggaaa 

60 

ttaggtgtta 

ttgaaccggc 

actagtgaaa 

atgaatgcaa 

ttaaggctgc 

aacagaagca 

120 

gtaacattag 

tgttaagaat 

agatgatatt 

gtaggtggtt 

ctggtggtac 

catacctgta 

180 

ataattttaa 

aagagggatc 

aagtagaaca 

tatggaaaag 

aagctttaag 

ggctaatatt 

240 

gctgcagtga 

aagcaattga 

agaggcatta 

aaaagcacct 

atggtccacg 

tggaatggat 

300 

aagattcttg 

ttgatagctt 

aggagatatt 

acaataacaa 

atgatggagc 

cactattctt 

360 

gataaaatgg 

atttacaaca 

cccaacaggt 

aagcttttag 

ttcagatagc 

taaaggacaa 

420 

gacgaggaaa 

cagctgatgg 

cactaaaact 

gctgtaattc 

ttgctggaga 

attagctaaa 

480 

aaagcagaag 

atcttttata 

taaggagatt 

cacccaacaa 

taattgtaag 

cggatataag 

540 

aaggcagaag 

aaattgcatt 

aaagaccatc 

caagatatag 

cacaaccggt 

cagcataaat 

600 

gatactgacg 

tacttaggaa 

agtagcatta 

acatccttag 

gcagtaaggc 

agtagcaggc 

660 

gcacgagagt 

atttagctga 

ccttgtggtt 

aaagcagtgg 

cacaagtagc 

agaattaaga 

720 

ggagataagt 

ggtatgttga 

tctagataat 

gtacaaatag 

ttaaaaaaca 

tggtggtagc 

780 

attaatgata 

ctcaattagt 

atacggcata 

gtagttgata 

aggaagttgt 

acatccgggc 

840 

atgccaaaga 

ggattgaaaa 

tgctaagata 

gcccttttag 

acgcttcatt 

agaagttgag 

900 

aaacccgaat 

tggatgcaga 

aataagaatt 

aacgatccaa 

cacagatgca 

caaattcttg 

960 

gaagaagaag 

aaaacatatt 

gaaagaaaaa 

gtagataaga 

ttgcagctac 

tggtgctaac 

1020 

gttgtaatag 

cgcagaaagg 

tatcgatgaa 

gttgcacaac 

actatttagc 

taagaaaggt 

1080 

atattagctg 

ttaggagagc 

caagaagagt 

gatttagaga 

aattagctag 

agctaccgga 

1140 

ggtagagtca 

tatcaaatat 

tgatgaatta 

acttcacaag 

atctaggtta 

tgccgcatta 

1200 

gtggaagaga 

gaaaagtagg 

tgaagacaag 

atcgtattcg 

tagaaggtgc 

aaagaatcca 

1260 

aaatcagtta 

gtatactaat 

aagaggagga 

ttagagagag 

tagtagatga 

gactgaaaga 

1320 

gctcttaggg 

acgctttagg 

tacagtggca 

gatgtaataa 

gggatggtag 

agcagtagct 

1380 

ggtggtggag 

ctgttgagat 

agagatagct 

aagagattaa 

gaaagtatgc 

cccacaagtt 

1440 

ggtggtaaag 

agcaattagc 

aattgaagca 

tatgctaatg 

caatagaggg 

tctcattatg 

1500 

atattggcgg 

aaaacgcagg 

attagatcct 

atagacaaat 

taatgcaatt 

aagaagtctt 

1560 


cacgagaatg agacc 1575 


<210 > SEQ ID NO 73 
<211> LENGTH: 525 
<212 > TYPE: PRT 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221> NAME /KEY: MIS C_FE ATURE 

<223 > OTHER INFORMATION: chaperonin permutant 480 
<400 > SEQUENCE: 73 

Met Ala Asn Lys Trp Tyr Gly Leu Asn Leu Phe Thr Gly Asn Pro Glu 
15 10 15 

Asp Met Trp Lys Leu Gly Val lie Glu Pro Ala Leu Val Lys Met Asn 
20 25 30 
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Ala He 

Asp lie 
50 

Glu Gly 
65 

Ala Ala 

Arg Gly 

Thr Asn 

Thr Gly 
130 

Ala Asp 
145 

Lys Ala 
Ser Gly 


lie Ala 

Ala Leu 
210 

Leu Ala 
225 

Gly Asp 

His Gly 

Asp Lys 

Lys lie 
290 

Asp Ala 
305 

Glu Glu 


Thr Gly 


Gin His 

Lys Ser 
370 

Ser Asn 
385 

Val Glu 
Ala Lys 
Arg Val 


Lys Ala 
35 

Val Gly 

Ser Ser 

Val Lys 

Met Asp 
100 

Asp Gly 
115 

Lys Leu 

Gly Thr 

Glu Asp 

Tyr Lys 
180 

Gin Pro 
195 

Thr Ser 

Asp Leu 

Lys Trp 

Gly Ser 
260 

Glu Val 
275 

Ala Leu 

Glu lie 

Glu Glu 

Ala Asn 
340 

Tyr Leu 
355 

Asp Leu 

lie Asp 

Glu Arg 

Asn Pro 
420 

Val Asp 
435 


Ala Thr 

Gly Ser 

Arg Thr 
70 

Ala lie 
85 

Lys lie 

Ala Thr 

Leu Val 

Lys Thr 
150 

Leu Leu 
165 

Lys Ala 

Val Ser 

Leu Gly 

Val Val 
230 

Tyr Val 
245 

lie Asn 

Val His 

Leu Asp 

Arg lie 
310 

Asn lie 
325 

Val Val 

Ala Lys 

Glu Lys 

Glu Leu 
390 

Lys Val 
405 

Lys Ser 
Glu Thr 


Glu Ala 
40 

Gly Gly 
55 

Tyr Gly 

Glu Glu 

Leu Val 

lie Leu 
120 

Gin lie 
135 

Ala Val 

Tyr Lys 

Glu Glu 

lie Asn 
200 

Ser Lys 
215 

Lys Ala 

Asp Leu 

Asp Thr 

Pro Gly 
280 

Ala Ser 
295 

Asn Asp 

Leu Lys 

lie Ala 

Lys Gly 
360 

Leu Ala 
375 

Thr Ser 

Gly Glu 

Val Ser 

Glu Arg 
440 


Val Thr 

Thr lie 

Lys Glu 

Ala Leu 
90 

Asp Ser 
105 

Asp Lys 

Ala Lys 

lie Leu 

Glu lie 
170 

lie Ala 
185 

Asp Thr 

Ala Val 

Val Ala 

Asp Asn 
250 

Gin Leu 
265 

Met Pro 


Leu Glu 


Pro Thr 

Glu Lys 
330 

Gin Lys 
345 

lie Leu 

Arg Ala 

Gin Asp 

Asp Lys 
410 

lie Leu 
425 

Ala Leu 


Leu Val 


Pro Val 
60 

Ala Leu 
75 

Lys Ser 

Leu Gly 

Met Asp 

Gly Gin 
140 

Ala Gly 
155 

His Pro 

Leu Lys 

Asp Val 

Ala Gly 
220 

Gin Val 
235 

Val Gin 
Val Tyr 


Lys Arg 

Val Glu 
300 

Gin Met 
315 

Val Asp 

Gly lie 

Ala Val 

Thr Gly 
380 

Leu Gly 
395 

lie Val 
lie Arg 
Arg Asp 


Leu Arg 
45 

lie lie 

Arg Ala 

Thr Tyr 

Asp lie 
110 

Leu Gin 
125 

Asp Glu 

Glu Leu 

Thr lie 

Thr lie 
190 

Leu Arg 
205 

Ala Arg 

Ala Glu 

lie Val 

Gly lie 
270 

lie Glu 
285 

Lys Pro 

His Lys 

Lys lie 

Asp Glu 
350 

Arg Arg 
365 

Gly Arg 

Tyr Ala 

Phe Val 

Gly Gly 
430 

Ala Leu 
445 


lie Asp 

Leu Lys 

Asn lie 
80 

Gly Pro 
95 

Thr lie 


His Pro 


Glu Thr 


Ala Lys 
160 

lie Val 
175 

Gin Asp 


Lys Val 


Glu Tyr 


Leu Arg 
240 

Lys Lys 
255 

Val Val 


Asn Ala 


Glu Leu 

Phe Leu 
320 

Ala Ala 
335 

Val Ala 

Ala Lys 

Val lie 

Ala Leu 
400 

Glu Gly 
415 

Leu Glu 
Gly Thr 
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Val 

Ala 

Asp 

Val 

lie 

Arg Asp 

Gly Arg 

Ala 

Val 

Ala 

Gly 

Gly 

Gly 

Ala 


450 





455 





460 





Val 

Glu 

He 

Glu 

lie 

Ala 

Lys 

Arg 

Leu 

Arg 

Lys 

Tyr 

Ala 

Pro 

Gin 

Val 

465 





470 





475 





480 

Gly Gly 

Lys 

Glu 

Gin 

Leu 

Ala 

lie 

Glu 

Ala 

Tyr 

Ala 

Asn 

Ala 

lie 

Glu 





485 





490 





495 


Gly 

Leu 

lie 

Met 

lie 

Leu 

Ala 

Glu 

Asn 

Ala 

Gly 

Leu 

Asp 

Pro 

lie 

Asp 




500 





505 





510 



Lys 

Leu 

Met 

Gin 

Leu Arg 

Ser 

Leu 

His 

Glu 

Asn 

Glu 

Thr 






515 





520 





525 





<210 > SEQ ID NO 74 
< 2 11 > LENGTH: 1575 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

< 22 1 > NAME/KEY: misc_f eature 

<223 > OTHER INFORMATION: chaperonin permutant 499 
< 4 00 > SEQUENCE: 74 


atggcattag 

gtgttattga 

accggcacta 

gtgaaaatga 

atgcaattaa 

ggctgcaaca 

60 

gaagcagtaa 

cattagtgtt 

aagaatagat 

gatattgtag 

gtggttctgg 

tggtaccata 

120 

cctgtaataa 

ttttaaaaga 

gggatcaagt 

agaacatatg 

gaaaagaagc 

tttaagggct 

180 

aatattgctg 

cagtgaaagc 

aattgaagag 

gcattaaaaa 

gcacctatgg 

tccacgtgga 

240 

atggataaga 

ttcttgttga 

tagcttagga 

gatattacaa 

taacaaatga 

tggagccact 

300 

attcttgata 

aaatggattt 

acaacaccca 

acaggtaagc 

ttttagttca 

gatagctaaa 

360 

ggacaagacg 

aggaaacagc 

tgatggcact 

aaaactgctg 

taattcttgc 

tggagaatta 

420 

gctaaaaaag 

cagaagatct 

tttatataag 

gagattcacc 

caacaataat 

tgtaagcgga 

480 

tataagaagg 

cagaagaaat 

tgcattaaag 

accatccaag 

atatagcaca 

accggtcagc 

540 

ataaatgata 

ctgacgtact 

taggaaagta 

gcattaacat 

ccttaggcag 

taaggcagta 

600 

gcaggcgcac 

gagagtattt 

agctgacctt 

gtggttaaag 

cagtggcaca 

agtagcagaa 

660 

ttaagaggag 

ataagtggta 

tgttgatcta 

gataatgtac 

aaatagttaa 

aaaacatggt 

720 

ggtagcatta 

atgatactca 

attagtatac 

ggcatagtag 

ttgataagga 

agttgtacat 

780 

cc 999 cat 9 c 

caaagaggat 

tgaaaatgct 

aagatagccc 

ttttagacgc 

ttcattagaa 

840 

gttgagaaac 

ccgaattgga 

tgcagaaata 

agaattaacg 

atccaacaca 

gatgcacaaa 

900 

ttcttggaag 

aagaagaaaa 

catattgaaa 

gaaaaagtag 

ataagattgc 

agctactggt 

960 

gctaacgttg 

taatagcgca 

gaaaggtatc 

gatgaagttg 

cacaacacta 

tttagctaag 

1020 

aaaggtatat 

tagctgttag 

gagagccaag 

aagagtgatt 

tagagaaatt 

agctagagct 

1080 

accggaggta 

gagtcatatc 

aaatattgat 

gaattaactt 

cacaagatct 

aggttatgcc 

1140 

gcattagtgg 

aagagagaaa 

agtaggtgaa 

gacaagatcg 

tattcgtaga 

aggtgcaaag 

1200 

aatccaaaat 

cagttagtat 

actaataaga 

ggaggattag 

agagagtagt 

agatgagact 

1260 

gaaagagctc 

ttagggacgc 

tttaggtaca 

gtggcagatg 

taataaggga 

tggtagagca 

1320 

gtagctggtg 

gtggagctgt 

tgagatagag 

atagctaaga 

gattaagaaa 

gtatgcccca 

1380 

caagttggtg 

gtaaagagca 

attagcaatt 

gaagcatatg 

ctaatgcaat 

agagggtctc 

1440 

attatgatat 

tggoggaaaa 

cgcaggatta 

gatcctatag 

acaaattaat 

gcaattaaga 

1500 

agtcttcacg 

agaatgagac 

caataaatgg 

tatggactta 

atttatttac 

tggaaatcca 

1560 


gaggatatgt ggaaa 


1575 
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<210 > SEQ ID NO 75 
<211> LENGTH: 525 
<212 > TYPE: PRT 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: chaperonin permutant 499 
< 4 00 > SEQUENCE: 75 


Met 

Ala 

Leu 

Gly 

Val 

lie 

Glu 

Pro 

Ala 

Leu 

Val 

Lys 

Met 

Asn 

Ala 

lie 

1 




5 





10 





15 


Lys 

Ala 

Ala 

Thr 

Glu 

Ala 

Val 

Thr 

Leu 

Val 

Leu 

Arg 

lie 

Asp 

Asp 

lie 




20 





25 





30 



Val 

Gly 

Gly 

Ser 

Gly 

Gly 

Thr 

lie 

Pro 

Val 

lie 

lie 

Leu 

Lys 

Glu 

Gly 



35 





40 





45 




Ser 

Ser 

Arg 

Thr 

Tyr 

Gly 

Lys 

Glu 

Ala 

Leu 

Arg 

Ala 

Asn 

lie 

Ala 

Ala 


50 





55 





60 





Val 

Lys 

Ala 

lie 

Glu 

Glu 

Ala 

Leu 

Lys 

Ser 

Thr 

Tyr 

Gly 

Pro 

Arg 

Gly 

65 





70 





75 





80 

Met 

Asp 

Lys 

lie 

Leu 

Val 

Asp 

Ser 

Leu 

Gly 

Asp 

lie 

Thr 

lie 

Thr 

Asn 





85 





90 





95 


Asp 

Gly 

Ala 

Thr 

lie 

Leu 

Asp 

Lys 

Met 

Asp 

Leu 

Gin 

His 

Pro 

Thr 

Gly 




100 





105 





110 



Lys 

Leu 

Leu 

Val 

Gin 

lie 

Ala 

Lys 

Gly 

Gin 

Asp 

Glu 

Glu 

Thr 

Ala 

Asp 



115 





120 





125 




Gly 

Thr 

Lys 

Thr 

Ala 

Val 

lie 

Leu 

Ala 

Gly 

Glu 

Leu 

Ala 

Lys 

Lys 

Ala 


130 





135 





140 





Glu 

Asp 

Leu 

Leu 

Tyr 

Lys 

Glu 

lie 

His 

Pro 

Thr 

lie 

lie 

Val 

Ser 

Gly 

145 





150 





155 





160 

Tyr 

Lys 

Lys 

Ala 

Glu 

Glu 

lie 

Ala 

Leu 

Lys 

Thr 

lie 

Gin 

Asp 

lie 

Ala 





165 





170 





175 


Gin 

Pro 

Val 

Ser 

lie 

Asn 

Asp 

Thr 

Asp 

Val 

Leu 

Arg 

Lys 

Val 

Ala 

Leu 




180 





185 





190 



Thr 

Ser 

Leu 

Gly 

Ser 

Lys 

Ala 

Val 

Ala 

Gly 

Ala 

Arg 

Glu 

Tyr 

Leu 

Ala 



195 





200 





205 




Asp 

Leu 

Val 

Val 

Lys 

Ala 

Val 

Ala 

Gin 

Val 

Ala 

Glu 

Leu 

Arg 

Gly 

Asp 


210 





215 





220 





Lys 

Trp 

Tyr 

Val 

Asp 

Leu 

Asp 

Asn 

Val 

Gin 

lie 

Val 

Lys 

Lys 

His 

Gly 

225 





230 





235 





240 

Gly 

Ser 

lie 

Asn 

Asp 

Thr 

Gin 

Leu 

Val 

Tyr 

Gly 

lie 

Val 

Val 

Asp 

Lys 





245 





250 





255 


Glu 

Val 

Val 

His 

Pro 

Gly 

Met 

Pro 

Lys 

Arg 

lie 

Glu 

Asn 

Ala 

Lys 

lie 




260 





265 





270 



Ala 

Leu 

Leu 

Asp 

Ala 

Ser 

Leu 

Glu 

Val 

Glu 

Lys 

Pro 

Glu 

Leu 

Asp 

Ala 



275 





280 





285 




Glu 

He 

Arg 

lie 

Asn 

Asp 

Pro 

Thr 

Gin 

Met 

His 

Lys 

Phe 

Leu 

Glu 

Glu 


290 





295 





300 





Glu 

Glu 

Asn 

lie 

Leu 

Lys 

Glu 

Lys 

Val 

Asp 

Lys 

lie 

Ala 

Ala 

Thr 

Gly 

305 





310 





315 





320 

Ala 

Asn 

Val 

Val 

lie 

Ala 

Gin 

Lys 

Gly 

lie 

Asp 

Glu 

Val 

Ala 

Gin 

His 





325 





330 





335 


Tyr 

Leu 

Ala 

Lys 

Lys 

Gly 

lie 

Leu 

Ala 

Val 

Arg 

Arg 

Ala 

Lys 

Lys 

Ser 




340 





345 





350 




Asp Leu Glu Lys Leu Ala Arg Ala Thr Gly Gly Arg Val lie Ser Asn 
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355 360 365 

lie Asp Glu Leu Thr Ser Gin Asp Leu Gly Tyr Ala Ala Leu Val Glu 
370 375 380 

Glu Arg Lys Val Gly Glu Asp Lys lie Val Phe Val Glu Gly Ala Lys 

385 390 395 400 

Asn Pro Lys Ser Val Ser lie Leu lie Arg Gly Gly Leu Glu Arg Val 

405 410 415 

Val Asp Glu Thr Glu Arg Ala Leu Arg Asp Ala Leu Gly Thr Val Ala 
420 425 430 

Asp Val lie Arg Asp Gly Arg Ala Val Ala Gly Gly Gly Ala Val Glu 
435 440 445 

lie Glu lie Ala Lys Arg Leu Arg Lys Tyr Ala Pro Gin Val Gly Gly 
450 455 460 

Lys Glu Gin Leu Ala lie Glu Ala Tyr Ala Asn Ala lie Glu Gly Leu 

465 470 475 480 

lie Met lie Leu Ala Glu Asn Ala Gly Leu Asp Pro lie Asp Lys Leu 

485 490 495 

Met Gin Leu Arg Ser Leu His Glu Asn Glu Thr Asn Lys Trp Tyr Gly 
500 505 510 

Leu Asn Leu Phe Thr Gly Asn Pro Glu Asp Met Trp Lys 
515 520 525 


< 2 10 > SEQ ID NO 76 
<211 > LENGTH: 2349 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

< 22 1 > NAME/KEY: misc_feature 

<223 > OTHER INFORMATION: chaperonin permutant 267-EYFP fusion 
< 4 00 > SEQUENCE: 76 

atggcagatc caacacagat gcacaaattc ttggaagaag aagaaaacat attgaaagaa 60 

aaagtagata agattgcagc tactggtgct aacgttgtaa tagcgcagaa aggtatcgat 120 

gaagttgcac aacactattt agctaagaaa ggtatattag ctgttaggag agccaagaag 180 

agtgatttag agaaattagc tagagctacc ggaggtagag tcatatcaaa tattgatgaa 240 

ttaacttcac aagatctagg ttatgccgca ttagtggaag agagaaaagt aggtgaagac 300 

aagatcgtat tcgtagaagg tgcaaagaat ccaaaatcag ttagtatact aataagagga 360 

ggattagaga gagtagtaga tgagactgaa agagctctta gggacgcttt aggtacagtg 420 

gcagatgtaa taagggatgg tagagcagta gctggtggtg gagctgttga gatagagata 480 

gctaagagat taagaaagta tgccccacaa gttggtggta aagagcaatt agcaattgaa 540 

gcatatgcta atgcaataga gggtctcatt atgatattgg cggaaaacgc aggattagat 600 

cctatagaca aattaatgca attaagaagt cttcacgaga atgagaccaa taaatggtat 660 

ggacttaatt tatttactgg aaatccagag gatatgtgga aattaggtgt tattgaaccg 720 

gcactagtga aaatgaatgc aattaaggct gcaacagaag cagtaacatt agtgttaaga 780 

atagatgata ttgtaggtgg ttctggtggt accatacctg taataatttt aaaagaggga 840 

tcaagtagaa catatggaaa agaagcttta agggctaata ttgctgcagt gaaagcaatt 900 

gaagaggcat taaaaagcac ctatggtcca cgtggaatgg ataagattct tgttgatagc 960 

ttaggagata ttacaataac aaatgatgga gccactattc ttgataaaat ggatttacaa 1020 


cacccaacag gtaagctttt agttcagata gctaaaggac aagacgagga aacagctgat 1080 
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ggcactaaaa 

ctgctgtaat 

tcttgctgga 

gaattagcta 

aaaaagcaga 

agatctttta 

tataaggaga 

ttcacccaac 

aataattgta 

agcggatata 

agaaggcaga 

agaaattgca 

ttaaagacca 

tccaagatat 

agcacaaccg 

gtcagcataa 

atgatactga 

cgtacttagg 

aaagtagcat 

taacatcctt 

aggcagtaag 

gcagtagcag 

gcgcacgaga 

gtatttagct 

gaccttgtgg 

ttaaagcagt 

ggcacaagta 

gcagaattaa 

gaggagataa 

gtggtatgtt 

gatctagata 

atgtacaaat 

agttaaaaaa 

catggtggta 

gcattaatga 

tactcaatta 

gtatacggca 

tagtagttga 

taaggaagtt 

gtacatccgg 

gcatgccaaa 

gaggattgaa 

aatgctaaga 

tagccctttt 

agacgcttca 

ttagaagttg 

agaaacccga 

attggatgca 

gaaataagaa 

ttaacggcag 

cggcggatcc 

ggggtgagea 

agggegagga 

gctgttcacc 

ggggtggtgc 

ccatcctggt 

cgagctggac 

ggcgacgtaa 

acggccacaa 

gttcagcgtg 

tccggcgagg 

gcgagggcga 

tgccacctac 

ggcaagctga 

ccctgaagtt 

catctgcacc 

accggcaagc 

tgcccgtgcc 

ctggcccacc 

ctcgtgacca 

ccttcggcta 

cggcctgcag 

tgcttcgccc 

gctaccccga 

ccacatgaag 

cagcacgact 

tcttcaagtc 

cgccatgccc 

gaaggctacg 

tccaggagcg 

caccatcttc 

ttcaaggacg 

acggcaacta 

caagacccgc 

gccgaggtga 

agttcgaggg 

cgacaccctg 

gtgaaccgca 

tcgagctgaa 

gggcatcgac 

ttcaaggagg 

acggcaacat 

cctggggcac 

aagctggagt 

acaacggcgg 

taccggatct 

ggaggtgagc: 

tcaacagcca 

caacgtctat 

atcatggccg 

acaagcagaa 

gaacggcatc 

aaggtgaact 

tcaagatccg 

ccacaacatc 

gaggacggca 

gcgtgcagct 

cgccgaccac 

taccagcaga 

acacccccat 

cggcgacggc 

cccgtgctgc 

tgcccgacaa 

ccactacctg 

agctaccagt 

ccgccctgag 

caaagacccc 

aacgagaagc 

gcgatcacat 

ggtcctgctg 

gagttcgtga 

ccgccgccgg 

gatcactctc 

ggcatggacg 

agctgtacaa 

gcaccaccat 


caccatcac 

<210 > SEQ ID NO 77 
<211> LENGTH: 783 
<212 > TYPE : PRT 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<22 1> NAME /KEY: MIS C_FE ATURE 

<223> OTHER INFORMATION: chaperonin permutant 267-EYFP fusion 
< 4 00 > SEQUENCE: 77 


Met 

Ala 

Asp 

Pro 

Thr 

Gin 

Met 

His 

Lys 

Phe 

Leu 

Glu 

Glu 

Glu 

Glu 

Asn 

1 




5 





10 





15 


He 

Leu 

Lys 

Glu 

Lys 

Val 

Asp 

Lys 

lie 

Ala 

Ala 

Thr 

Gly 

Ala 

Asn 

Val 




20 





25 





30 



Val 

lie 

Ala 

Gin 

Lys 

Gly 

lie 

Asp 

Glu 

Val 

Ala 

Gin 

His 

Tyr 

Leu 

Ala 



35 





40 





45 




Lys 

Lys 

Gly 

lie 

Leu 

Ala 

Val 

Arg 

Arg 

Ala 

Lys 

Lys 

Ser 

Asp 

Leu 

Glu 


50 





55 





60 





Lys 

Leu 

Ala 

Arg 

Ala 

Thr 

Gly 

Gly 

Arg 

Val 

lie 

Ser 

Asn 

lie 

Asp 

Glu 

65 





70 





75 





80 

Leu 

Thr 

Ser 

Gin 

Asp 

Leu Gly 

Tyr 

Ala 

Ala 

Leu 

Val 

Glu 

Glu 

Arg 

Lys 





85 





90 





95 


Val 

Gly Glu Asp 

Lys 

lie 

Val 

Phe 

Val 

Glu 

Gly 

Ala 

Lys 

Asn 

Pro 

Lys 




100 





105 





110 



Ser 

Val 

Ser 

lie 

Leu 

lie 

Arg 

Gly 

Gly 

Leu 

Glu 

Arg 

Val 

Val 

Asp 

Glu 



115 





120 





125 





1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2349 
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Thr Glu 
13 0 

Arg Asp 
145 

Ala Lys 

Leu Ala 

Leu Ala 

Arg Ser 
210 

Phe Thr 
225 

Ala Leu 


Leu Val 


Pro Val 


Ala Leu 
290 

Lys Ser 
305 

Leu Gly 

Met Asp 

Gly Gin 

Ala Gly 
370 

His Pro 
385 

Leu Lys 

Asp Val 

Ala Gly 

Gin Val 
450 

Val Gin 
465 

Val Tyr 

Lys Arg 

Val Glu 

Gly Ser 
530 

lie Leu 


Arg Ala 

Gly Arg 

Arg Leu 

He Glu 
180 

Glu Asn 
195 

Leu His 

Gly Asn 

Val Lys 

Leu Arg 
260 

lie lie 
275 

Arg Ala 

Thr Tyr 

Asp lie 

Leu Gin 
340 

Asp Glu 
355 

Glu Leu 

Thr lie 

Thr lie 

Leu Arg 
420 

Ala Arg 
435 

Ala Glu 

lie Val 

Gly lie 

lie Glu 
500 

Lys Pro 
515 

Gly Val 
Val Glu 


Leu Arg 

Ala Val 
150 

Arg Lys 
165 

Ala Tyr 

Ala Gly 

Glu Asn 

Pro Glu 
230 

Met Asn 
245 

lie Asp 

Leu Lys 

Asn lie 

Gly Pro 
310 

Thr lie 
325 

His Pro 

Glu Thr 

Ala Lys 

lie Val 
390 

Gin Asp 
405 

Lys Val 

Glu Tyr 

Leu Arg 

Lys Lys 
470 

Val Val 
485 

Asn Ala 
Glu Leu 
Ser Lys 
Leu Asp 


Asp Ala 
135 

Ala Gly 

Tyr Ala 

Ala Asn 

Leu Asp 
200 

Glu Thr 
215 

Asp Met 

Ala lie 

Asp lie 

Glu Gly 
280 

Ala Ala 
295 

Arg Gly 

Thr Asn 

Thr Gly 

Ala Asp 
360 

Lys Ala 
375 

Ser Gly 

lie Ala 

Ala Leu 

Leu Ala 
440 

Gly Asp 
455 

His Gly 

Asp Lys 

Lys lie 

Asp Ala 
520 

Gly Glu 
535 

Gly Asp 


Leu Gly 

Gly Gly 

Pro Gin 
170 

Ala lie 
185 

Pro lie 

Asn Lys 

Trp Lys 

Lys Ala 
250 

Val Gly 
265 

Ser Ser 

Val Lys 

Met Asp 

Asp Gly 
330 

Lys Leu 
345 

Gly Thr 

Glu Asp 

Tyr Lys 

Gin Pro 
410 

Thr Ser 
425 

Asp Leu 

Lys Trp 

Gly Ser 

Glu Val 
490 

Ala Leu 
505 

Glu lie 
Glu Leu 
Val Asn 


Thr Val 
140 

Ala Val 
155 

Val Gly 

Glu Gly 

Asp Lys 

Trp Tyr 
220 

Leu Gly 
235 

Ala Thr 
Gly Ser 


Arg Thr 


Ala lie 
300 

Lys lie 
315 

Ala Thr 


Leu Val 


Lys Thr 


Leu Leu 
380 

Lys Ala 
395 

Val Ser 

Leu Gly 

Val Val 

Tyr Val 
460 

lie Asn 
475 

Val His 

Leu Asp 

Arg lie 

Phe Thr 
540 

Gly His 


Ala Asp 

Glu lie 

Gly Lys 

Leu lie 
190 

Leu Met 
205 

Gly Leu 

Val lie 

Glu Ala 

Gly Gly 
270 

Tyr Gly 
285 

Glu Glu 

Leu Val 

lie Leu 

Gin lie 
350 

Ala Val 
365 

Tyr Lys 

Glu Glu 

lie Asn 

Ser Lys 
430 

Lys Ala 
445 

Asp Leu 

Asp Thr 

Pro Gly 

Ala Ser 
510 

Asn Gly 
525 

Gly Val 
Lys Phe 


Val lie 

Glu lie 
160 

Glu Gin 
175 

Met lie 
Gin Leu 


Asn Leu 


Glu Pro 
240 

Val Thr 
255 

Thr lie 


Lys Glu 


Ala Leu 


Asp Ser 
320 

Asp Lys 
335 

Ala Lys 


lie Leu 


Glu lie 


lie Ala 
400 

Asp Thr 
415 

Ala Val 


Val Ala 


Asp Asn 

Gin Leu 
480 

Met Pro 
495 

Leu Glu 
Ser Gly 
Val Pro 
Ser Val 
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545 550 555 560 


Ser 

Gly 

Glu 

Gly 

Glu 

Gly 

Asp 

Ala 

Thr 

Tyr 

Gly 

Lys 

Leu 

Thr 

Leu 

Lys 





565 





570 





575 


Phe 

He 

Cys 

Thr 

Thr 

Gly 

Lys 

Leu 

Pro 

Val 

Pro 

Trp 

Pro 

Thr 

Leu 

Val 




580 





585 





590 



Thr 

Thr 

Phe 

Gly 

Tyr 

Gly 

Leu 

Gin 

Cys 

Phe 

Ala 

Arg 

Tyr 

Pro 

Asp 

His 



595 





600 





605 




Met 

Lys 

Gin 

His 

Asp 

Phe 

Phe 

Lys 

Ser 

Ala 

Met 

Pro 

Glu 

Gly 

Tyr 

Val 


610 





615 





620 





Gin 

Glu 

Arg 

Thr 

lie 

Phe 

Phe 

Lys 

Asp 

Asp 

Gly 

Asn 

Tyr 

Lys 

Thr 

Arg 

625 





630 





635 





640 

Ala 

Glu 

Val 

Lys 

Phe 

Glu 

Gly 

Asp 

Thr 

Leu 

Val 

Asn 

Arg 

lie 

Glu 

Leu 





645 





650 





655 


Lys 

Gly 

lie 

Asp 

Phe 

Lys 

Glu 

Asp 

Gly 

Asn 

lie 

Leu 

Gly 

His 

Lys 

Leu 




660 





665 





670 



Glu 

Tyr 

Asn 

Gly 

Gly 

Thr 

Gly 

Ser 

Gly 

Gly 

Glu 

Leu 

Asn 

Ser 

His 

Asn 



675 





680 





685 




Val 

Tyr 

lie 

Met 

Ala 

Asp 

Lys 

Gin 

Lys 

Asn 

Gly 

lie 

Lys 

Val 

Asn 

Phe 


690 





695 





700 





Lys 

lie 

Arg 

His 

Asn 

lie 

Glu 

Asp 

Gly 

Ser 

Val 

Gin 

Leu 

Ala 

Asp 

His 

705 





710 





715 





720 

Tyr 

Gin 

Gin 

Asn 

Thr 

Pro 

lie 

Gly 

Asp 

Gly 

Pro 

Val 

Leu 

Leu 

Pro 

Asp 





725 





730 





735 


Asn 

His 

Tyr 

Leu 

Ser 

Tyr 

Gin 

Ser 

Ala 

Leu 

Ser 

Lys 

Asp 

Pro 

Asn 

Glu 




740 





745 





750 



Lys 

Arg 

Asp 

His 

Met 

Val 

Leu 

Leu 

Glu 

Phe 

Val 

Thr 

Ala 

Ala 

Gly 

lie 



755 





760 





765 




Thr 

Leu 

Gly 

Met 

Asp 

Glu 

Leu 

Tyr 

Lys 

His 

His 

His 

His 

His 

His 



770 





775 





780 






<210 > SEQ ID NO 78 
<211> LENGTH: 2355 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221> NAME/KEY: misc_f eature 

<223> OTHER INFORMATION: chaperonin permutant 480-EYFP fusion 
< 4 00 > SEQUENCE: 78 


atggcaaata 

aatggtatgg 

acttaattta 

tttactggaa 

atccagagga 

tatgtggaaa 

60 

ttaggtgtta 

ttgaaccggc 

actagtgaaa 

atgaatgcaa 

ttaaggctgc 

aacagaagca 

120 

gtaacattag 

tgttaagaat 

agatgatatt 

gtaggtggtt 

ctggtggtac 

catacctgta 

180 

ataattttaa 

aagagggatc 

aagtagaaca 

tatggaaaag 

aagctttaag 

ggctaatatt 

240 

gctgcagtga 

aagcaattga 

agaggoatta 

aaaagcacct 

atggtccacg 

tggaatggat 

300 

aagattcttg 

ttgatagctt 

aggagatatt 

acaataacaa 

atgatggagc 

cactattctt 

360 

gataaaatgg 

atttacaaca 

cccaacaggt 

aagcttttag 

ttcagatagc 

taaaggacaa 

420 

gacgaggaaa 

cagctgatgg 

cactaaaact 

gctgtaattc 

ttgctggaga 

attagctaaa 

480 

aaagcagaag 

atcttttata 

taaggagatt 

cacccaacaa 

taattgtaag 

cggatataag 

540 

aaggcagaag 

aaattgcatt 

aaagaccatc 

caagatatag 

cacaaccggt 

cagcataaat 

600 

gatactgacg 

tacttaggaa 

agtagcatta 

acatccttag 

gcagtaaggc 

agtagcaggc 

660 

gcacgagagt 

atttagctga 

ccttgtggtt 

aaagcagtgg 

cacaagtagc 

agaattaaga 

720 
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ggagataagt 

ggtatgttga 

tctagataat 

gtacaaatag 

ttaaaaaaca 

tggtggtago 

780 

attaatgata 

ctcaattagt 

atacggcata 

gtagttgata 

aggaagttgt 

acatccgggc 

840 

atgccaaaga 

ggattgaaaa 

tgctaagata 

gcccttttag 

acgcttcatt 

agaagttgag 

900 

aaacccgaat 

tggatgcaga 

aataagaatt 

aacgatccaa 

cacagatgca 

caaattcttg 

960 

gaagaagaag 

aaaacatatt 

gaaagaaaaa 

gtagataaga 

ttgcagctac 

tggtgctaac 

1020 

gttgtaatag 

cgcagaaagg 

tatcgatgaa 

gttgcacaac 

actatttagc 

taagaaaggt 

1080 

atattagctg 

ttaggagagc 

caagaagagt 

gatttagaga 

aattagctag 

agctaccgga 

1140 

ggtagagtca 

tatcaaatat 

tgatgaatta 

acttcacaag 

atctaggtta 

tgccgcatta 

1200 

gtggaagaga 

gaaaagtagg 

tgaagacaag 

atcgtattcg 

tagaaggtgc 

aaagaatcca 

1260 

aaatcagtta 

gtatactaat 

aagaggagga 

ttagagagag 

tagtagatga 

gactgaaaga 

1320 

gctcttaggg 

acgctttagg 

tacagtggca 

gatgtaataa 

gggatggtag 

agcagtagct 

1380 

ggtggtggag 

ctgttgagat 

agagatagct 

aagagattaa 

gaaagtatgc 

cccacaagtt 

1440 

ggtggtaaag 

agcaattagc 

aattgaagca 

tatgctaatg 

caatagaggg 

tctcattatg 

1500 

atattggcgg 

aaaacgcagg 

attagatcct 

atagacaaat 

taatgcaatt 

aagaagtctt 

1560 

cacgagaatg 

agaccggctc 

tggcggatco 

ggatccgggg 

tgagcaaggg 

ogaggagotg 

1620 

ttcaccgggg 

tggtg cc cat 

cctggtcgag 

ctggacggcg 

acgtaaacgg 

ccacaagttc 

1680 

agcgtgtccg 

gcgagggcga 

gggogatgoc 

acctacggca 

agctgaccct 

gaagttcatc 

1740 

tgcaccaccg 

gcaagctgcc 

cgtgccctgg 

cccaccctcg 

tgaccacctt 

cggctacggc 

1800 

ctgcagtgct 

tcgcccgcta 

ccccgaccac 

atgaagcagc 

acgacttctt 

caagtccgcc 

1860 

atgcccgaag 

gctacgtcca 

ggagcgcacc 

atcttcttca 

aggacgacgg 

caactacaag 

1920 

acccgcgccg 

aggtgaagtt 

cgagggogao 

accctggtga 

accgcatcga 

gctgaagggc 

1980 

atcgacttca 

aggaggaegg 

caacatcctg 

gggcacaagc 

tggagtaoaa 

cggoggtaoc 

2040 

ggatctggag 

gtgagctcaa 

cagccacaac 

gtctatatca 

tggccgacaa 

gcagaagaac 

2100 

ggcatcaagg 

tgaacttcaa 

gatccgccac 

aacatcgagg 

acggcagcgt 

gcagctcgcc 

2160 

gaccactacc 

agcagaacac 

ccccatcggc 

gacggccccg 

tgctgctgcc 

cgacaaccac 

2220 

tacctgagct 

accagtccgc 

cctgagcaaa 

gaccccaacg 

agaagcgcga 

tcacatggtc 

2280 

ctgctggagt 

tcgtgaccgc 

cgccgggatc 

actctcggca 

tggacgagct 

gtacaagcac 

2340 

caccatcacc 

atcac 





2355 


<210 > SEQ ID NO 79 
<211> LENGTH: 785 
<212 > TYPE: PRT 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221> NAME /KEY: MIS C_FEATURE 

<223 > OTHER INFORMATION: chaperonin permutant 480-EYFP fusion 
<400 > SEQUENCE: 79 

Met Ala Asn Lys Trp Tyr Gly Leu Asn Leu Phe Thr Gly Asn Pro Glu 
15 10 15 

Asp Met Trp Lys Leu Gly Val lie Glu Pro Ala Leu Val Lys Met Asn 
20 25 30 

Ala lie Lys Ala Ala Thr Glu Ala Val Thr Leu Val Leu Arg lie Asp 
35 40 45 

Asp He Val Gly Gly Ser Gly Gly Thr lie Pro Val lie lie Leu Lys 
50 55 60 
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Glu Gly 

Ser 

Ser 

Arg 

Thr 

Tyr 

Gly 

Lys 

Glu 

Ala 

Leu 

Arg 

Ala 

Asn 

lie 

65 





70 





75 





80 

Ala 

Ala 

Val 

Lys 

Ala 

lie 

Glu 

Glu 

Ala 

Leu 

Lys 

Ser 

Thr 

Tyr 

Gly 

Pro 





85 





90 





95 


Arg 

Gly 

Met 

Asp 

Lys 

lie 

Leu 

Val 

Asp 

Ser 

Leu 

Gly 

Asp 

lie 

Thr 

lie 




100 





105 





110 



Thr 

Asn 

Asp 

Gly 

Ala 

Thr 

lie 

Leu 

Asp 

Lys 

Met 

Asp 

Leu 

Gin 

His 

Pro 



115 





120 





125 




Thr 

Gly 

Lys 

Leu 

Leu 

Val 

Gin 

lie 

Ala 

Lys 

Gly 

Gin 

Asp 

Glu 

Glu 

Thr 


130 





135 





140 





Ala 

Asp 

Gly 

Thr 

Lys 

Thr 

Ala 

Val 

lie 

Leu 

Ala 

Gly 

Glu 

Leu 

Ala 

Lys 

145 





150 





155 





160 

Lys 

Ala 

Glu 

Asp 

Leu 

Leu 

Tyr 

Lys 

Glu 

lie 

His 

Pro 

Thr 

lie 

lie 

Val 





165 





170 





175 


Ser 

Gly 

Tyr 

Lys 

Lys 

Ala 

Glu 

Glu 

lie 

Ala 

Leu 

Lys 

Thr 

lie 

Gin 

Asp 




180 





185 





190 



He 

Ala 

Gin 

Pro 

Val 

Ser 

lie 

Asn 

Asp 

Thr 

Asp 

Val 

Leu 

Arg 

Lys 

Val 



195 





200 





205 




Ala 

Leu 

Thr 

Ser 

Leu 

Gly 

Ser 

Lys 

Ala 

Val 

Ala 

Gly 

Ala 

Arg 

Glu 

Tyr 


210 





215 





220 





Leu 

Ala 

Asp 

Leu 

Val 

Val 

Lys 

Ala 

Val 

Ala 

Gin 

Val 

Ala 

Glu 

Leu 

Arg 

225 





230 





235 





240 

Gly Asp 

Lys 

Trp 

Tyr 

Val 

Asp 

Leu 

Asp 

Asn 

Val 

Gin 

lie 

Val 

Lys 

Lys 





245 





250 





255 


His 

Gly 

Gly 

Ser 

lie 

Asn 

Asp 

Thr 

Gin 

Leu 

Val 

Tyr 

Gly 

lie 

Val 

Val 




260 





265 





270 



Asp 

Lys 

Glu 

Val 

Val 

His 

Pro 

Gly 

Met 

Pro 

Lys 

Arg 

lie 

Glu 

Asn 

Ala 



275 





280 





285 




Lys 

lie 

Ala 

Leu 

Leu 

Asp 

Ala 

Ser 

Leu 

Glu 

Val 

Glu 

Lys 

Pro 

Glu 

Leu 


290 





295 





300 





Asp 

Ala 

Glu 

lie 

Arg 

lie 

Asn 

Asp 

Pro 

Thr 

Gin 

Met 

His 

Lys 

Phe 

Leu 

305 





310 





315 





320 

Glu 

Glu 

Glu 

Glu 

Asn 

lie 

Leu 

Lys 

Glu 

Lys 

Val 

Asp 

Lys 

lie 

Ala 

Ala 





325 





330 





335 


Thr 

Gly 

Ala 

Asn 

Val 

Val 

lie 

Ala 

Gin 

Lys 

Gly 

lie 

Asp 

Glu 

Val 

Ala 




340 





345 





350 



Gin 

His 

Tyr 

Leu 

Ala 

Lys 

Lys 

Gly 

lie 

Leu 

Ala 

Val 

Arg 

Arg 

Ala 

Lys 



355 





360 





365 




Lys 

Ser 

Asp 

Leu 

Glu 

Lys 

Leu 

Ala 

Arg 

Ala 

Thr 

Gly 

Gly 

Arg 

Val 

lie 


370 





375 





380 





Ser 

Asn 

lie 

Asp 

Glu 

Leu 

Thr 

Ser 

Gin 

Asp 

Leu 

Gly 

Tyr 

Ala 

Ala 

Leu 

385 





390 





395 





400 

Val 

Glu 

Glu 

Arg 

Lys 

Val 

Gly 

Glu 

Asp 

Lys 

lie 

Val 

Phe 

Val 

Glu 

Gly 





405 





410 





415 


Ala 

Lys 

Asn 

Pro 

Lys 

Ser 

Val 

Ser 

lie 

Leu 

lie 

Arg 

Gly 

Gly 

Leu 

Glu 




420 





425 





430 



Arg 

Val 

Val 

Asp 

Glu 

Thr 

Glu 

Arg 

Ala 

Leu 

Arg 

Asp 

Ala 

Leu 

Gly 

Thr 



435 





440 





445 




Val 

Ala 

Asp 

Val 

lie 

Arg 

Asp 

Gly 

Arg 

Ala 

Val 

Ala 

Gly 

Gly 

Gly 

Ala 


450 





455 





460 





Val 

Glu 

lie 

Glu 

lie 

Ala 

Lys 

Arg 

Leu 

Arg 

Lys 

Tyr 

Ala 

Pro 

Gin 

Val 

465 





470 





475 





480 


Gly Gly Lys Glu Gin Leu Ala lie Glu Ala Tyr Ala Asn Ala lie Glu 
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-continued 


485 490 495 


Gly 

Leu 

lie 

Met 

lie 

Leu 

Ala 

Glu 

Asn 

Ala 

Gly 

Leu 

Asp 

Pro 

lie 

Asp 




500 





505 





510 



Lys 

Leu 

Met 

Gin 

Leu 

Arg 

Ser 

Leu 

His 

Glu 

Asn 

Glu 

Thr 

Gly 

Ser 

Gly 



515 





520 





525 




Gly 

Ser 

Gly 

Ser 

Gly 

Val 

Ser 

Lys 

Gly 

Glu 

Glu 

Leu 

Phe 

Thr 

Gly 

Val 


530 





535 





540 





Val 

Pro 

lie 

Leu 

Val 

Glu 

Leu Asp 

Gly 

Asp 

Val 

Asn 

Gly 

His 

Lys 

Phe 

545 





550 





555 





560 

Ser 

Val 

Ser 

Gly 

Glu 

Gly 

Glu 

Gly 

Asp 

Ala 

Thr 

Tyr 

Gly 

Lys 

Leu 

Thr 





565 





570 





575 


Leu 

Lys 

Phe 

lie 

Cys 

Thr 

Thr 

Gly 

Lys 

Leu 

Pro 

Val 

Pro 

Trp 

Pro 

Thr 




580 





585 





590 



Leu 

Val 

Thr 

Thr 

Phe 

Gly 

Tyr 

Gly 

Leu 

Gin 

Cys 

Phe 

Ala 

Arg 

Tyr 

Pro 



595 





600 





605 




Asp 

His 

Met 

Lys 

Gin 

His 

Asp 

Phe 

Phe 

Lys 

Ser 

Ala 

Met 

Pro 

Glu 

Gly 


610 





615 





620 





Tyr 

Val 

Gin 

Glu 

Arg 

Thr 

lie 

Phe 

Phe 

Lys 

Asp 

Asp 

Gly 

Asn 

Tyr 

Lys 

625 





630 





635 





640 

Thr 

Arg 

Ala 

Glu 

Val 

Lys 

Phe 

Glu 

Gly 

Asp 

Thr 

Leu 

Val 

Asn 

Arg 

lie 





645 





650 





655 


Glu 

Leu 

Lys 

Gly 

lie 

Asp 

Phe 

Lys 

Glu 

Asp 

Gly 

Asn 

lie 

Leu 

Gly 

His 




660 





665 





670 



Lys 

Leu 

Glu 

Tyr 

Asn 

Gly 

Gly 

Thr 

Gly 

Ser 

Gly 

Gly 

Glu 

Leu 

Asn 

Ser 



675 





680 





685 




His 

Asn 

Val 

Tyr 

lie 

Met 

Ala 

Asp 

Lys 

Gin 

Lys 

Asn 

Gly 

lie 

Lys 

Val 


690 





695 





700 





Asn 

Phe 

Lys 

lie 

Arg 

His 

Asn 

lie 

Glu 

Asp 

Gly 

Ser 

Val 

Gin 

Leu 

Ala 

705 





710 





715 





720 

Asp 

His 

Tyr 

Gin 

Gin 

Asn 

Thr 

Pro 

lie 

Gly 

Asp 

Gly 

Pro 

Val 

Leu 

Leu 





725 





730 





735 


Pro 

Asp 

Asn 

His 

Tyr 

Leu 

Ser 

Tyr 

Gin 

Ser 

Ala 

Leu 

Ser 

Lys 

Asp 

Pro 




740 





745 





750 



Asn 

Glu 

Lys 

Arg 

Asp 

His 

Met 

Val 

Leu 

Leu 

Glu 

Phe 

Val 

Thr 

Ala 

Ala 



755 





760 





765 




Gly 

He 

Thr 

Leu 

Gly 

Met 

Asp 

Glu 

Leu 

Tyr 

Lys 

His 

His 

His 

His 

His 


770 775 780 


His 

785 


<210 > SEQ ID NO 80 
<211> LENGTH: 1680 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221> NAME/KEY: misc_f eature 

<223 > OTHER INFORMATION: TF55 alpha subunit 
<400 > SEQUENCE: 80 


atggcgagcc 

cggtcttgtt 

attgaaagag 

ggaacgagta 

gaactactgg 

tagagatgcg 

60 

ttaaggaata 

atatacttgc 

tgcaaagaca 

ctagccgaaa 

tgttaaggag 

tagtttaggt 

120 

cctaaaggtc 

ttgataaaat 

gttaattgat 

agtttcggtg 

acgtaaccat 

aactaatgat 

180 

ggtgctacaa 

tagtaaagga 

tatggagata 

cagcatccag 

cagcaaagct 

attagtagaa 

240 

gcagctaaag 

cacaagatgc 

tgaagtaggt 

gatggtacta 

caagcgctgt 

agtattagct 

300 
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ggtgctctat 

tggagaaago 

tgaaagttta 

ttggatcaaa 

atatacatcc 

aacaataatt 

360 

attgaggggt 

ataagaaggc 

atataccaag 

gccttggagt 

tacttccaca 

gttaggaact 

420 

aggatcgata 

taagggattt 

gaattcttca 

gttgctaggg 

atactctaag 

aaaaatagca 

480 

tttactacac 

tagcaagtaa 

gtttattgca 

gaaggtgctg 

aattaaataa 

aataattgac 

540 

atggtaatag 

acgcaatagt 

taatgttgca 

gaacctctac 

ctaatggtgg 

gtacaatgtg 

600 

agtttagact 

taataaagat 

agataagaag 

aaaggtggaa 

gtatagagga 

tagcgtctta 

660 

gttaaaggac 

tagtattaga 

taaggaggtt 

gtgcaccctg 

gaatgcctag 

aagagtcact 

720 

aaagccaaga 

tagctgtttt 

ggatgcagca 

ttagaggtag 

aaaagcctga 

aatctcagct 

780 

aagataagta 

taacatcacc 

agagcaaatc 

aaggctttct 

tagatgagga 

atccaaatat 

840 

cttaaggaca 

tggttgataa 

actagcgtca 

ataggcgcta 

acgttgtaat 

atgccagaaa 

900 

ggtattgatg 

atatcgcaca 

gcacttctta 

gctaagaaag 

gtatattggc 

tgtaagaagg 

960 

gttaagagga 

gcgatataga 

gaaattagag 

aaggcattag 

gcgcgagaat 

aataagcagt 

1020 

attaaagacg 

ctactcccga 

tgatttagga 

tacgccgaat 

tagttgagga 

aaggagagtt 

1080 

ggaaatgaca 

aaatggtatt 

tatagaaggt 

gctaagaatc 

tgaaggccgt 

gaatatcttg 

1140 

ttaagaggtt 

caaatgatat 

ggcattagat 

gaggctgaga 

ggagtataaa 

tgatgcattg 

1200 

catgctctga 

ggaacatatt 

attagagcca 

gtaatattgc 

caggtggtgg 

cgctatcgag 

1260 

ttagaattag 

cgatgaaatt 

aagagagtat 

gctagaagtg 

taggaggtaa 

ggagcaatta 

1320 

gctatagaag 

catttgcaga 

tgcattagag 

gagataccta 

tgattttagc 

tgaaactgca 

1380 

gggctggagg 

ctatatctgc 

actaatggac 

ttaagagcta 

gacacgctaa 

gggattaacc 

1440 

aatactggtg 

tagatgtaat 

aggtgggaag 

atcgtagacg 

atgtatatgc 

cttaaacatc 

1500 

attgagccta 

taagagtaaa 

ggctcaagtg 

ttaaagagcg 

caacagaggc 

ggctacagca 

1560 

atattaaaga 

ttgatgacct 

aatagctgca 

gctccattaa 

agagcgagaa 

gaaaggtgga 

1620 

gaaggaagta 

aagaagaaag 

tggtggagaa 

ggaggagotg 

gtactccatc 

tttaggagac 

1680 


< 2 10 > SEQ ID NO 81 
<211> LENGTH: 1272 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

< 22 1 > NAME/KEY: misc_f eature 

< 223 > OTHER INFORMATION: chaperonin dwarf deletion permutant 267 

< 4 00 > SEQUENCE: 81 


atggtattcg 

tagaaggtgc 

aaagaatcca 

aaatcagtta 

gtatactaat 

aagaggagga 

60 

ttagagagag 

tagtagatga 

gactgaaaga 

gctcttaggg 

acgctttagg 

tacagtggca 

120 

gatgtaataa 

gggatggtag 

agcagtagct 

ggtggtggag 

ctgttgagat 

agagatagct 

180 

aagagattaa 

gaaagtatgc 

cccacaagtt 

ggtggtaaag 

agcaattagc 

aattgaagca 

240 

tatgctaatg 

caatagaagg 

acttatcatg 

atattggcgg 

aaaacgcagg 

attagatcct 

300 

atagacaaat 

taatgcaatt 

aagaagtctt 

cacgagaatg 

agaccaataa 

atggtatgga 

360 

cttaatttat 

ttactggaaa 

tccagaggat 

atgtggaaat 

taggtgttat 

tgaaccggca 

420 

ctagtgaaaa 

tgaatgcaat 

taaggctgca 

acagaagcag 

taacattagt 

gttaagaata 

480 

gatgatattg 

taggtggttc 

tggtggtacc 

atacctgtaa 

taattttaaa 

agagggatca 

540 

agtagaacat 

atggaaaaga 

agctttaagg 

gctaatattg 

ctgcagtgaa 

agcaattgaa 

600 

gaggcattaa 

aaagcaccta 

tggtccacgt 

ggaatggata 

agatgcttgt 

tgatagctta 

660 
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ggagatatta 

caataacaaa 

tgatggagcc 

actattcttg 

ataaaatgga 

tttacaacac 

720 

ccaacaggta 

agcttttagt 

tcagatagct 

aaaggacaag 

acgaggaaac 

agctgatggc 

780 

actaaaactg 

ctgtaattct 

tgctggagaa 

ttagctaaaa 

aagcagaaga 

tcttttatat 

840 

aaggagattc 

acccaacaat 

aattgtaagc 

ggatataaga 

aggcagaaga 

aattgcatta 

900 

aagaccatcc 

aagatatagc 

acaaccggtc 

agcataaatg 

atactgacgt 

acttaggaaa 

960 

gtagcattaa 

catccttagg 

cagtaaggca 

gtagcaggcg 

cacgagagta 

tttagctgac 

1020 

cttgtggtta 

aagcagtggc 

acaagtagca 

gaattaagag 

gagataagtg 

gtatgttgat 

1080 

ctagataatg 

tacaaatagt 

taaaaaacat 

ggtggtagoa 

ttaatgatac 

tcaattagta 

1140 

tacggcatag 

tagttgataa 

ggaagttgta 

catccgggca 

tgccaaagag 

gattgaaaat 

1200 

gctaagatag 

cccttttaga 

cgcttcatta 

gaagttgaga 

aacccgaatt 

ggatgcagaa 

1260 

ataagaatta 

ac 





1272 


< 2 10 > SEQ ID NO 82 
< 2 11 > LENGTH: 424 
<212 > TYPE: PRT 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221 > NAME /KEY: MIS C_FE ATURE 

<223 > OTHER INFORMATION: chaperonin dwarf deletion permutant 267 

< 4 00 > SEQUENCE: 82 


Met 

Val 

Phe 

Val 

Glu 

Gly 

Ala 

Lys 

Asn 

Pro 

Lys 

Ser 

Val 

Ser 

lie 

Leu 

1 




5 





10 





15 


He 

Arg 

Gly 

Gly 

Leu 

Glu 

Arg 

Val 

Val 

Asp 

Glu 

Thr 

Glu 

Arg 

Ala 

Leu 




20 





25 





30 



Arg 

Asp 

Ala 

Leu 

Gly 

Thr 

Val 

Ala 

Asp 

Val 

lie 

Arg 

Asp 

Gly 

Arg 

Ala 



35 





40 





45 




Val 

Ala 

Gly 

Gly 

Gly 

Ala 

Val 

Glu 

lie 

Glu 

lie 

Ala 

Lys 

Arg 

Leu 

Arg 


50 





55 





60 





Lys 

Tyr 

Ala 

Pro 

Gin 

Val 

Gly 

Gly 

Lys 

Glu 

Gin 

Leu 

Ala 

lie 

Glu 

Ala 

65 





70 





75 





80 

Tyr 

Ala 

Asn 

Ala 

lie 

Glu 

Gly 

Leu 

lie 

Met 

lie 

Leu 

Ala 

Glu 

Asn 

Ala 





85 





90 





95 


Gly 

Leu 

Asp 

Pro 

lie 

Asp 

Lys 

Leu 

Met 

Gin 

Leu 

Arg 

Ser 

Leu 

His 

Glu 




100 





105 





110 



Asn 

Glu 

Thr 

Asn 

Lys 

Trp 

Tyr 

Gly 

Leu 

Asn 

Leu 

Phe 

Thr 

Gly 

Asn 

Pro 



115 





120 





125 




Glu 

Asp 

Met 

Trp 

Lys 

Leu 

Gly 

Val 

lie 

Glu 

Pro 

Ala 

Leu 

Val 

Lys 

Met 


130 





135 





140 





Asn 

Ala 

lie 

Lys 

Ala 

Ala 

Thr 

Glu 

Ala 

Val 

Thr 

Leu 

Val 

Leu 

Arg 

lie 

145 





150 





155 





160 

Asp 

Asp 

lie 

Val 

Gly 

Gly 

Ser 

Gly 

Gly 

Thr 

lie 

Pro 

Val 

lie 

lie 

Leu 





165 





170 





175 


Lys 

Glu 

Gly 

Ser 

Ser 

Arg 

Thr 

Tyr 

Gly 

Lys 

Glu 

Ala 

Leu 

Arg 

Ala 

Asn 




180 





185 





190 



lie 

Ala 

Ala 

Val 

Lys 

Ala 

lie 

Glu 

Glu 

Ala 

Leu 

Lys 

Ser 

Thr 

Tyr 

Gly 



195 





200 





205 




Pro 

Arg 

Gly 

Met 

Asp 

Lys 

Met 

Leu 

Val 

Asp 

Ser 

Leu 

Gly 

Asp 

lie 

Thr 


210 





215 





220 





lie 

Thr 

Asn 

Asp 

Gly 

Ala 

Thr 

lie 

Leu 

Asp 

Lys 

Met 

Asp 

Leu 

Gin 

His 

225 





230 





235 





240 
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-continued 


Pro 

Thr 

Gly 

Lys 

Leu 

Leu 

Val 

Gin 

lie 

Ala 

Lys 

Gly 

Gin 

Asp 

Glu 

Glu 





245 





250 





255 


Thr 

Ala 

Asp 

Gly 

Thr 

Lys 

Thr 

Ala 

Val 

lie 

Leu 

Ala 

Gly 

Glu 

Leu 

Ala 




260 





265 





270 



Lys 

Lys 

Ala 

Glu 

Asp 

Leu 

Leu 

Tyr 

Lys 

Glu 

lie 

His 

Pro 

Thr 

lie 

lie 



275 





280 





285 




Val 

Ser 

Gly 

Tyr 

Lys 

Lys 

Ala 

Glu 

Glu 

lie 

Ala 

Leu 

Lys 

Thr 

lie 

Gin 


290 





295 





300 





Asp 

He 

Ala 

Gin 

Pro 

Val 

Ser 

lie 

Asn 

Asp 

Thr 

Asp 

Val 

Leu 

Arg 

Lys 

305 





310 





315 





320 

Val 

Ala 

Leu 

Thr 

Ser 

Leu 

Gly 

Ser 

Lys 

Ala 

Val 

Ala 

Gly 

Ala 

Arg 

Glu 





325 





330 





335 


Tyr 

Leu 

Ala 

Asp 

Leu 

Val 

Val 

Lys 

Ala 

Val 

Ala 

Gin 

Val 

Ala 

Glu 

Leu 




340 





345 





350 



Arg 

Gly 

Asp 

Lys 

Trp 

Tyr 

Val 

Asp 

Leu 

Asp 

Asn 

Val 

Gin 

lie 

Val 

Lys 



355 





360 





365 




Lys 

His 

Gly 

Gly 

Ser 

lie 

Asn 

Asp 

Thr 

Gin 

Leu 

Val 

Tyr 

Gly 

lie 

Val 


370 





375 





380 





Val 

Asp 

Lys 

Glu 

Val 

Val 

His 

Pro 

Gly 

Met 

Pro 

Lys 

Arg 

lie 

Glu 

Asn 

385 





390 





395 





400 

Ala 

Lys 

lie 

Ala 

Leu 

Leu 

Asp 

Ala 

Ser 

Leu 

Glu 

Val 

Glu 

Lys 

Pro 

Glu 





405 





410 





415 


Leu 

Asp 

Ala 

Glu 

lie 

Arg 

lie 

Asn 












420 














< 2 10 > SEQ ID NO 83 
<211 > LENGTH: 6 
<212 > TYPE : PRT 

<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223 > OTHER INFORMATION: peptide linker 267 EYFP 

< 4 00 > SEQUENCE: 83 

Gly Ser Gly Gly Ser Gly 
1 5 


< 2 10 > SEQ ID NO 84 
<211 > LENGTH: 8 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial 
<220 > FEATURE: 

<223 > OTHER INFORMATION: peptide linker 267 EYFP 

< 4 00 > SEQUENCE: 84 

Gly Ser Gly Gly Ser Gly Ser Gly 
1 5 


<210 > SEQ ID NO 85 
<211> LENGTH: 1656 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221> NAME/KEY: misc_feature 

<223 > OTHER INFORMATION: TF55 beta subunit (HENET) 
<400 > SEQUENCE: 85 


atggcaacag 

ctacagttgc 

aactacaccc 

gaaggtatac 

ctgtaataat 

tttaaaagag 

60 

ggatcaagta 

gaacatatgg 

aaaagaagct 

ttaagggcta 

atattgctgc 

agtgaaagca 

120 

attgaagagg 

cattaaaaag 

cacctatggt 

ccacgtggaa 

tggataagat 

gttcgttgat 

180 
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164 


-continued 


agcttaggag 

atattacaat 

aacaaatgat 

ggagccacta 

ttcttgataa 

aatggattta 

240 

caacacccaa 

caggtaagct 

tttagttcag 

atagctaaag 

gacaagacga 

ggaaacagct 

300 

gatggcacta 

aaactgctgt 

aattcttgct 

ggagaattag 

ctaaaaaagc 

agaagatctt 

360 

ttatataagg 

agattcaccc 

aacaataatt 

gtaagcggat 

ataagaaggc 

agaagaaatt 

420 

gcattaaaga 

ccatccaaga 

tatagcacaa 

ccggtcagca 

taaatgatac 

tgacgtactt 

480 

aggaaagtag 

cattaacatc 

cttaggcagt 

aaggcagtag 

caggcgcacg 

agagtattta 

540 

gctgaccttg 

tggttaaagc 

agtggcacaa 

gtagcagaat 

taagaggaga 

taagtggtat 

600 

gttgatctag 

ataatgtaca 

aatagttaaa 

aaacatggtg 

gtagcattaa 

tgatactcaa 

660 

ttagtatacg 

gcatagtagt 

tgataaggaa 

gttgtacatc 

cgggcatgcc 

aaagaggatt 

720 

gaaaatgcta 

agatagccct 

tttagacgct 

tcattagaag 

ttgagaaacc 

cgaattggat 

780 

gcagaaataa 

gaattaacga 

tccaacacag 

atgcacaaat 

tcttggaaga 

agaagaaaac 

840 

atattgaaag 

aaaaagtaga 

taagattgca 

gctactggtg 

ctaacgttgt 

aatatgccag 

900 

aaaggtatcg 

atgaagttgc 

acaacactat 

ttagctaaga 

aaggtatatt 

agctgttagg 

960 

agagccaaga 

agagtgattt 

agagaaatta 

gctagagcta 

ccggaggtag 

agtcatatca 

1020 

aatattgatg 

aattaacttc 

acaagatcta 

ggttatgccg 

cattagtgga 

agagagaaaa 

1080 

gtaggagagg 

ataagatggt 

attcgtagaa 

ggtgcaaaga 

atccaaaatc 

agttagtata 

1140 

ctaataagag 

gaggattaga 

gagagtagta 

gatgagactg 

aaagagctct 

tagggacgct 

1200 

ttaggtacag 

tggcagatgt 

aataagggat 

ggtagagcag 

tagctggtgg 

tggagctgtt 

1260 

gagatagaga 

tagctaagag 

attaagaaag 

tatgccccac 

aagttggtgg 

taaagagcaa 

1320 

ttagcaattg 

aagcatatgc 

taatgcaata 

gaaggactta 

tcatgatatt 

ggcggaaaac 

1380 

gcaggattag 

atcctataga 

caaattaatg 

caattaagaa 

gtcttcacga 

gaatgagacc 

1440 

aataaatggt 

atggacttaa 

tttatttact 

ggaaatccag 

aggatatgtg 

gaaattaggt 

1500 

gttattgaac 

cggcactagt 

gaaaatgaat 

gcaattaagg 

ctgcaacaga 

agcagtaaca 

1560 

ttagtgttaa 

gaatagatga 

tattgtagca 

gctggaaaga 

agggtggaag 

tgagccaggc 

1620 

ggtaagaaag 

agaaagaaga 

aaagtcctct 

gaagac 



1656 

< 2 10 > SEQ ID NO 86 
< 2 11 > LENGTH: 1656 
<212 > TYPE: DNA 

<213 > ORGANISM: Sulfolobus shibatae 
<220 > FEATURE: 

<221> NAME/KEY: misc_feature 

<223 > OTHER INFORMATION: TF55 beta subunit 

(HDNET) 



< 4 00 > SEQUENCE: 86 






atggcaacag 

ctacagttgc 

aactacaccc 

gaaggtatac 

ctgtaataat 

tttaaaagag 

60 

ggatcaagta 

gaacatatgg 

aaaagaagct 

ttaagggcta 

atattgctgc 

agtgaaagca 

120 

attgaagagg 

cattaaaaag 

cacctatggt 

ccacgtggaa 

tggataagat 

gttcgttgat 

180 

agcttaggag 

atattacaat 

aacaaatgat 

ggagccacta 

ttcttgataa 

aatggattta 

240 

caacacccaa 

caggtaagct 

tttagttcag 

atagctaaag 

gacaagacga 

ggaaacagct 

300 

gatggcacta 

aaactgctgt 

aattcttgct 

ggagaattag 

ctaaaaaagc 

agaagatctt 

360 

ttatataagg 

agattcaccc 

aacaataatt 

gtaagcggat 

ataagaaggc 

agaagaaatt 

420 

gcattaaaga 

ccatccaaga 

tatagcacaa 

ccggtcagca 

taaatgatac 

tgacgtactt 

480 

aggaaagtag 

cattaacatc 

cttaggcagt 

aaggcagtag 

caggcgcacg 

agagtattta 

540 
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166 


-continued 


gctgaccttg 

tggttaaagc 

agtggcacaa 

gtagcagaat 

taagaggaga 

taagtggtat 

600 

gttgatctag 

ataatgtaca 

aatagttaaa 

aaacatggtg 

gtagcattaa 

tgatactcaa 

660 

ttagtatacg 

gcatagtagt 

tgataaggaa 

gttgtacatc 

cgggcatgcc 

aaagaggatt 

720 

gaaaatgcta 

agatagccct 

tttagacgct 

tcattagaag 

ttgagaaacc 

cgaattggat 

780 

gcagaaataa 

gaattaacga 

tccaacacag 

atgcacaaat 

tcttggaaga 

agaagaaaac 

840 

atattgaaag 

aaaaagtaga 

taagattgca 

gctactggtg 

ctaacgttgt 

aatatgccag 

900 

aaaggtatcg 

atgaagttgc 

acaacactat 

ttagctaaga 

aaggtatatt 

agctgttagg 

960 

agagccaaga 

agagtgattt 

agagaaatta 

gctagagcta 

ccggaggtag 

agtcatatca 

1020 

aatattgatg 

aattaacttc 

acaagatcta 

ggttatgccg 

cattagtgga 

agagagaaaa 

1080 

gtaggagagg 

ataagatggt 

attcgtagaa 

ggtgcaaaga 

atccaaaatc 

agttagtata 

1140 

ctaataagag 

gaggattaga 

gagagtagta 

gatgagactg 

aaagagctct 

tagggacgct 

1200 

ttaggtacag 

tggcagatgt 

aataagggat 

ggtagagcag 

tagctggtgg 

tggagctgtt 

1260 

gagatagaga 

tagctaagag 

attaagaaag 

tatgccccac 

aagttggtgg 

taaagagcaa 

1320 

ttagcaattg 

aagcatatgc 

taatgcaata 

gaaggactta 

tcatgatatt 

ggcggaaaac 

1380 

gcaggattag 

atcctataga 

caaattaatg 

caattaagaa 

gtcttcacga 

caatgagacc 

1440 

aataaatggt 

atggacttaa 

tttatttact 

ggaaatccag 

aggatatgtg 

gaaattaggt 

1500 

gttattgaac 

cggcactagt 

gaaaatgaat 

gcaattaagg 

ctgcaacaga 

agcagtaaca 

1560 

ttagtgttaa 

gaatagatga 

tattgtagca 

gctggaaaga 

agggtggaag 

tgagccaggc 

1620 

ggtaagaaag 

agaaagaaga 

aaagtcctct 

gaagac 



1656 


< 2 10 > SEQ ID NO 87 
< 2 11 > LENGTH: 28 
<212 > TYPE : PRT 

<213> ORGANISM: Artificial Sequence 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Au 
< 4 00 > SEQUENCE: 87 

Pro Gly Met Lys Ala Ser Lys Ser Met Arg Asn Gin Ala Thr Pro Gly 
15 10 15 

Met Pro Ser Ser Leu Asp Leu Thr Trp Gin Ala Thr 
20 25 


< 2 10 > SEQ ID NO 88 
<211 > LENGTH: 28 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial Sequence 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Au 
<400 > SEQUENCE: 88 

Pro Gly Met Lys Met Arg Leu Ser Gly Ala Lys Glu Ala Thr Pro Gly 
15 10 15 

Met Ser Thr Thr Val Ala Gly Leu Leu Gin Ala Thr 
20 25 


<210 > SEQ ID NO 89 
<211> LENGTH: 28 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial Sequence 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Au 


<400 > SEQUENCE: 89 
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168 


-continued 


Pro Gly Met He His Val Gin Lys Thr Ala Val Gin Ala Thr Pro Gly 
15 10 15 

Met Val Asn Leu Thr Ser Pro Val Lys Gin Ala Thr 
20 25 


<210 > SEQ ID NO 90 
<211 > LENGTH: 14 
<212 > TYPE: PRT 

<213 > ORGANISM: Artificial Sequence 
<220 > FEATURE: 

<223 > OTHER INFORMATION: metal binding peptide - Au 
< 4 00 > SEQUENCE: 90 

Ala Leu Asp Ser Pro Ala Gly Cys Leu Ser Phe Ser Met His 
15 10 


< 2 10 > SEQ ID NO 91 
< 2 11 > LENGTH: 246 
<212 > TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220 > FEATURE: 

<223 > OTHER INFORMATION: EYFP Yellow Fluroescent Protein 


< 4 00 > SEQUENCE: 

91 












Val 

Ser 

Lys 

Gly 

Glu 

Glu 

Leu 

Phe 

Thr 

Gly 

Val 

Val 

Pro 

lie 

Leu 

Val 

1 




5 





10 





15 


Glu 

Leu 

Asp 

Gly 

Asp 

Val 

Asn 

Gly 

His 

Lys 

Phe 

Ser 

Val 

Ser 

Gly 

Glu 




20 





25 





30 



Gly 

Glu 

Gly 

Asp 

Ala 

Thr 

Tyr 

Gly 

Lys 

Leu 

Thr 

Leu 

Lys 

Phe 

lie 

Cys 



35 





40 





45 




Thr 

Thr 

Gly 

Lys 

Leu 

Pro 

Val 

Pro 

Trp 

Pro 

Thr 

Leu 

Val 

Thr 

Thr 

Phe 


50 





55 





60 





Gly 

Tyr 

Gly 

Leu 

Gin 

Cys 

Phe 

Ala 

Arg 

Tyr 

Pro 

Asp 

His 

Met 

Lys 

Gin 

65 





70 





75 





80 

His 

Asp 

Phe 

Phe 

Lys 

Ser 

Ala 

Met 

Pro 

Glu 

Gly 

Tyr 

Val 

Gin 

Glu 

Arg 





85 





90 





95 


Thr 

lie 

Phe 

Phe 

Lys 

Asp 

Asp 

Gly 

Asn 

Tyr 

Lys 

Thr 

Arg 

Ala 

Glu 

Val 




100 





105 





110 



Lys 

Phe 

Glu 

Gly 

Asp 

Thr 

Leu 

Val 

Asn 

Arg 

lie 

Glu 

Leu 

Lys 

Gly 

lie 



115 





120 





125 




Asp 

Phe 

Lys 

Glu 

Asp 

Gly 

Asn 

lie 

Leu 

Gly 

His 

Lys 

Leu 

Glu 

Tyr 

Asn 


130 





135 





140 





Gly Gly 

Thr 

Gly 

Ser 

Gly 

Gly 

Glu 

Leu 

Asn 

Ser 

His 

Asn 

Val 

Tyr 

lie 

145 





150 





155 





160 

Met 

Ala 

Asp 

Lys 

Gin 

Lys 

Asn 

Gly 

lie 

Lys 

Val 

Asn 

Phe 

Lys 

lie 

Arg 





165 





170 





175 


His 

Asn 

lie 

Glu 

Asp 

Gly 

Ser 

Val 

Gin 

Leu 

Ala 

Asp 

His 

Tyr 

Gin 

Gin 




180 





185 





190 



Asn 

Thr 

Pro 

lie 

Gly 

Asp 

Gly 

Pro 

Val 

Leu 

Leu 

Pro 

Asp 

Asn 

His 

Tyr 



195 





200 





205 




Leu 

Ser 

Tyr 

Gin 

Ser 

Ala 

Leu 

Ser 

Lys 

Asp 

Pro 

Asn 

Glu 

Lys 

Arg 

Asp 


210 





215 





220 





His 

Met 

Val 

Leu 

Leu 

Glu 

Phe 

Val 

Thr 

Ala 

Ala 

Gly 

lie 

Thr 

Leu 

Gly 

225 





230 





235 





240 


Met Asp Glu Leu Tyr Lys 
245 
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What is claimed: 

1. A circularly permuted chaperonin peptide molecule, 
comprising a new carboxyl-terminal amino acid and a new 
amino -terminal amino acid and a reordered chaperonin 
polypeptide sequence, wherein the circularly permuted chap- 
eronin polypeptide molecule comprises a chaperonin 
polypeptide molecule which comprises a native carboxyl- 
terminal amino acid and a native amino -terminal amino acid, 
wherein the native terminal amino acids are joined directly 
together or joined via an intervening linker polypeptide 
sequence, and wherein the chaperonin polypeptide molecule, 
so joined, is cleaved at a location that differs from the location 
of the joined native terminal amino acids so as to generate a 
new carboxyl terminal amino acid and a new amino terminal 
amino acid and a reordered chaperonin polypeptide 
sequence, and wherein the chaperonin polypeptide molecule 
comprises a Sulfolobus shibatae thermophilic factor 55 
(TF55) beta polypeptide according to any one of SEQ ID 
NOS:69, 67, 71, 73, or 75. 

2. The circularly permuted chaperonin polypeptide mol- 
ecule of claim 1 comprising an intervening linker polypeptide 
sequence which joins the native carboxyl terminal amino acid 
with the native amino terminal amino acid. 

3. The circularly permuted chaperonin polypeptide mol- 
ecule of claim 2, wherein the intervening linker polypeptide 


170 

sequence comprises the amino acid sequence Gly-Gly-Ser- 
Gly-Gly-Thr according to SEQ ID NO: 64. 

4 . The circularly permuted chaperonin polypeptide mol- 
ecule of claim 1, which is a fusion protein, wherein said new 
carboxyl terminal amino acid or said new amino terminal 
amino acid is joined with a second polypeptide. 

5 . The circularly permuted chaperonin polypeptide mol- 
ecule of claim 4 , wherein said second polypeptide is a gold- 
binding peptide comprising the amino acid sequence SEQ ID 
NO:31. 

6. The circularly permuted chaperonin polypeptide mol- 
ecule of claim 4 , wherein said second polypeptide is a yellow 
fluorescent protein (EYFP) according to SEQ ID NO: 91 . 

7 . The circularly permuted chaperonin polypeptide mol- 
ecule of claim 1, comprising a Sulfolobus shibatae TF55 beta 
polypeptide according to SEQ ID NO: 67. 

8. The circularly permuted chaperonin polypeptide mol- 
ecule of claim 1, comprising a Sulfolobus shibatae TF55 beta 
polypeptide according to SEQ ID NO: 7 1 . 

9. The circularly permuted chaperonin polypeptide mol- 
ecule of claim 1, comprising a Sulfolobus shibatae TF55 beta 
polypeptide according to SEQ ID NO: 73. 

10. The circularly permuted chaperonin polypeptide mol- 
ecule of claim 1, comprising a Sulfolobus shibatae TF55 beta 
polypeptide according to SEQ ID NO:75. 



